Gene Sros_4040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_4040 
Symbol 
ID8667334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp4497413 
End bp4498720 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content72% 
IMG OID 
Productmajor facilitator transporter 
Protein accessionYP_003339691 
Protein GI271965495 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.248694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0016848 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACCGCCA CCGAAACCAG GACGCTGCCC ATCGGCGACG CGTTCGACCG GATGCCGTTC 
ACCCGCAGGC ACGTTCTGAT CGCGCTGGCG CTGTTCGTCG CGTTCGTCAT CGAGTCCTGG
GAGCAGCTCG CCCTCATCTA CGTGTCCGCG GACCTCGGCA CGGCCTTCGG CCTCGACGAG
GGCGGGATCG GGCTGGTGCT GTCGGCCGTC GCGTTCGGCA TGATCCCCGG CGCGCTGATC
TGGGGGCCGG TCGCCGACCG GATCGGCCGC CGGCCCGCCT GCGTCTGGTC CCTGGCCGCC
TACGGGGTGA TCGCGCTGGC CTCGGCGTTC GCGCCGAACG TCGAGACCCT CGTGGCGCTG
CGAGTGGCCT CCGGGCTCGC GCTGGCGGGC GTCTACACCG TCACCTTCCC CTACTTCCTG
GAGCTGCTGC CCACCAGGAG CAGGGGCAGG GCGGCGGTCT ACCTGTCGAT CGGCTGGCCG
ATCGGCATGC TGGCCGCCAT CGGCGCCTCG GTCTGGCTGG GCGACCTCGG CTGGCACGTG
GTCGTCATCG CCAGCGCGGT GGCGGGCCTG TGGGCGTTCG CGATCAGGGC CTGGGTGCCC
GAGTCGCCCT ACTGGCTGGC CGCGAGGGGC CGCCAGGACG AGGCCCGGGC GGTGCTGCGC
CGGCTGGGCA GCCCCGACGC CGACGCGGTC TTCACGGTCG CCACCGAGCG CGCCGGTCAC
CCGCTGGACC TGCTGCGCGG CAGGCTCCGC AGGATCACGG TGCTGATGCT GCTGCTGAAC
TTCGCCTTCA ACTGGGGCTA CTGGGGCCTG CAGACCTGGC TGCCCACGCT GCTGCAGGAG
AAGGGGCTGA GCATGGACGC GAGCCTCGGC TTCGCGGCGC TCAGCGCCCT CATGATGATC
CCGGGCTACG TCAGCGCGTC GCTGCTCACC GGCCGTTTCG GCCGCAAGAA GGTCTTCCTG
GTCTACGTGG TGGCCGCGGC CCTCGGCGGG CTGGGCTTCG CCACCGCGTC CACGATGACC
GGCCTCTACG TGGGCAACTT CGTCCTGTCG TTCTTCAGCC TGGGCGCCTG GGGCGTGTGG
AACACCTGGA ACGGCGAGTT CTACCCGACC GCGCTGCGCG GCACCGGCTA CTCCTGGGCG
ACCGCCTCCC AGCTCGTGGC CACCACCGTC GCCCCGTCGG CGGTGGGGAT GCTGCTCGCC
CACGCCACCG GCTTCACCGC GACCATGCTG GTGATCAACG CGTTCATGGT GGTGACGGCG
CTGCTGGCCG TACCGCTGCC GGAGACCGAG GGGCGCGGCC TGGAATGA
 
Protein sequence
MTATETRTLP IGDAFDRMPF TRRHVLIALA LFVAFVIESW EQLALIYVSA DLGTAFGLDE 
GGIGLVLSAV AFGMIPGALI WGPVADRIGR RPACVWSLAA YGVIALASAF APNVETLVAL
RVASGLALAG VYTVTFPYFL ELLPTRSRGR AAVYLSIGWP IGMLAAIGAS VWLGDLGWHV
VVIASAVAGL WAFAIRAWVP ESPYWLAARG RQDEARAVLR RLGSPDADAV FTVATERAGH
PLDLLRGRLR RITVLMLLLN FAFNWGYWGL QTWLPTLLQE KGLSMDASLG FAALSALMMI
PGYVSASLLT GRFGRKKVFL VYVVAAALGG LGFATASTMT GLYVGNFVLS FFSLGAWGVW
NTWNGEFYPT ALRGTGYSWA TASQLVATTV APSAVGMLLA HATGFTATML VINAFMVVTA
LLAVPLPETE GRGLE