Gene Sros_3971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3971 
Symbol 
ID8667265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4425414 
End bp4427003 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content68% 
IMG OID 
Productmonocarboxylic acid permease 
Protein accessionYP_003339624 
Protein GI271965428 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0691283 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.470061 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGAGC ACCTGACCGA GATCATTGTC TTCTCCACAC TGTTCCTGCT CGTCAGCGGC 
ATGGGCTTCG TGGCCGCGCG CTGGCGCCGG CCGGACAACC TGGCCACCTT GGACGAGTGG
GGGCTGGGCG GCCGGAGTTT CGGTCCCTGG ATCACCTGGT TCCTCGTCGG CGGCGACCTC
TACACCGCCT ACACCTTCGT GGCCGTGCCC GCCCTGCTCT GGAGCGCGGG CGCGATGGGC
TTCTTCGCCG TGCCGTACAC GATCGTGGTC TATCCGATCG TGTTCCTGGT GCTGCTGCGA
CTCTGGTCGG TCTCGCACGT GCACGGGTTC GTGACCCCGG CCGACTTCGT CCGGGCCCGG
TTCGGCTCCC CCACGTTGGC GCTGCTGATC GCGATCACCG GCATCGTCGC GACGATGCCC
TACATCGCGC TCCAGCTCGT CGGCATCGAG GCCGTACTGA AGTCGATGGG GGTGACCGGC
CACCTGCCGA TCATCATCGC GTTCGCCATC CTGGCCGCCT ACACCTACCA GTCGGGCCTG
CGCGCCCCGG CGCTGATCGC CTTCGTCAAG GACACGCTGA TCTACATCGT GATCCTGGTC
GCGATCATCG TCATCCCGGC CAAGCTGGGA GGCTGGGGCA CGATCTTCGA CGACGCCCAG
GCCAAGTTCG CCGCCACGCC CGCGCCGGGG GACGGCATCC TGCTCAACGC CGGCAACCAG
CTCCAGTACG TCACGCTGGC CCTGGGATCG GCGCTCGCGC TGTTCCTCTA CCCGCACAGC
ATCACCGGCG TGCTGGCCTC ACGCAACCGC GATGTGATCA AGCGGAACAT GTCCGCGCTC
CCCGCCTACA GCCTGCTGCT CGGCCTGATC GCGTTGCTCG GCTACATGGC CATCTCGGCC
GGGGTCAAGC CCATCGGCAC GGACAACAAC ACGATCGTGC CCCAGCTGTT CGACAAGATG
TTCCCCGACT GGTTCACCGG CGTCGCCTAC GCCGCGATCG GCATCGGCGC GCTGGTCCCC
GCGGCGATCA TGTCGATCGC CGCGGCGAAC CTGTTCACCC GCAACATCTA CAAGGAGTAT
CTGAAGCCGG CCGCCAGCGA GGCCGACGAG GCCCGCGTCT CGAAGATCAC CTCACTGCTC
GTCAAGATCG GCGCGGTGCT GTGCATCCTG TTCCTGGACA CCGGCTTCTC CATCGACCTC
CAGCTCATCG GCGGCGTCAT CATCCTGCAG ACGCTCCCGT CGGTGGCGCT CGGCCTCTAC
ACCCGCTGGT TCCACCGGAT CGGCCTCATC GCCGGATGGG CGGGAGGCAT GGCCGCCGGG
ACGCTCCTGC TCTACAACAT CGGCAACCCG GCCACCGGCA AGCTGCACTT CGCCGGATCG
GCGTTCCCCC TGGAGAAGCT GGGCCTGGAC ACCAAGATGA CCATCTACGC GGGCGTCCTC
GCCCTGGCCG TCAACCTGAT CGTCGCCGCC GTCGCCACGC TCATCGCCCG CGGCGCCAAG
GCGTCCGAGG GTGACGACGC CACCCGGCCC GACCACTACC TCGCCGACGA GGGCGACCCC
CGCATCAAGG ACCTCGACCT CACCCACTGA
 
Protein sequence
MSEHLTEIIV FSTLFLLVSG MGFVAARWRR PDNLATLDEW GLGGRSFGPW ITWFLVGGDL 
YTAYTFVAVP ALLWSAGAMG FFAVPYTIVV YPIVFLVLLR LWSVSHVHGF VTPADFVRAR
FGSPTLALLI AITGIVATMP YIALQLVGIE AVLKSMGVTG HLPIIIAFAI LAAYTYQSGL
RAPALIAFVK DTLIYIVILV AIIVIPAKLG GWGTIFDDAQ AKFAATPAPG DGILLNAGNQ
LQYVTLALGS ALALFLYPHS ITGVLASRNR DVIKRNMSAL PAYSLLLGLI ALLGYMAISA
GVKPIGTDNN TIVPQLFDKM FPDWFTGVAY AAIGIGALVP AAIMSIAAAN LFTRNIYKEY
LKPAASEADE ARVSKITSLL VKIGAVLCIL FLDTGFSIDL QLIGGVIILQ TLPSVALGLY
TRWFHRIGLI AGWAGGMAAG TLLLYNIGNP ATGKLHFAGS AFPLEKLGLD TKMTIYAGVL
ALAVNLIVAA VATLIARGAK ASEGDDATRP DHYLADEGDP RIKDLDLTH