Gene Sros_1494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_1494 
Symbol 
ID8664770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp1577657 
End bp1578973 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content68% 
IMG OID 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_003337230 
Protein GI271963034 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCAC GCTCTAAGCA CCTGGCGGTC CTGGCCACGG CGCTCATGCT CGGCACCGGA 
CTCGCCGCGT GCGGGTCGGA CGACGCGGGT TCCCCCGGCG GCAAGACCAC GATCACCTTC
TGGGACGACA ACGGCGGCCC CGCCCGCACC CCGGTGTGGC AGCACATCAT CGCCGAGTTC
CAGAAGGCGA ACCCGACGAT CACCGTCAAG TACGTCGGCA TCCCGATCGC CCAGGCGCAG
CAGAAGTACG ACACCGCCAT CGCCGGCGGC GGCCTGCCCG ACGTGGGCGG TGTCTCCACC
GCGATGCTGT CCAGCCTGGT GGCGCAGAAG GCGCTGGAGC CGCTGGACGG CCGGATCTCC
GGCGGGGCGC TGAACGGCAG GCTGAACGAC CAGGTCGTCA CGTCCGTCAA GGCCACCGTG
CCCGACGGCA AGACCTACAT GGTGCCGATG TCCACCAACA TGGGCGTGTT CTGGTACCGG
ACGGACTGGT TCGGGGAGGC CGGGCTGGAG CCGCCCCGGA ACTGGGGCGA CTTCTTCACC
GCGACCGAGA AGCTGACCGA CACCGCCAAG AACCGCTACG GCTTCACCAT CCGCGGCGGC
GCGGGCTCGA TCGCGCAGAT GCTGGAGGTG GTGTACGGCC AGTCCGGCAT CACCGAGATC
TTCGACGCCG ACGGCAAGGC GACGGTCAAC GACCCGAAGA ACGTCGCGGC CCTGGAGAAA
CTGGCCGGGC TCTACAAGAA GGTCACCCCC GAGGCGGACG TCAGCAACGA CTACGTGAAG
ATGGTCGCCC AGTTCGACGG CGGCAACATC GCGATCATGC AGCACAACCT GGGCTCGTTC
AACGACCACG TCAAGACGCT GGGCAAGGAC AAGGTGGCGG CCATGGCCGT GCCGAAGTCC
GACGGCGGCG TCCAGGCGAT CCTGTCCAAC CCGGTCTCGG GGATCGGGCT GTTCGCGAGC
GGCGAGAAGA AGGACGCCGC CTACAAGTTC GCCGAGTTCG CCGCGTCCAA GGCGATGAAC
AGCCACTGGG CGGAGAAGAC CGGCGTGCTC CCGGCCAACA CCGAGGTGAA CGGCGAGGCC
TGGATCCAGG GGCTGCCGCA CATCGCCGAG GCGGTCAAGG TGCTCAACGA CCCGGCGACC
AAGGTCGTGC AGATGCCGTA CTACCTGCCG GAGTTCAACG CGATCACCAA GACCGACATG
GAGCCGGAGT TCCAGAAGGT CCTGCAGGGC ACGCTGCCCG CCAAGGACTT CCTGGACGCG
TTCGCGGCCA AGCTGACCGA GGCGCAGGCC TCCTACAAGC AGCGCAACGG CGGCTGA
 
Protein sequence
MTARSKHLAV LATALMLGTG LAACGSDDAG SPGGKTTITF WDDNGGPART PVWQHIIAEF 
QKANPTITVK YVGIPIAQAQ QKYDTAIAGG GLPDVGGVST AMLSSLVAQK ALEPLDGRIS
GGALNGRLND QVVTSVKATV PDGKTYMVPM STNMGVFWYR TDWFGEAGLE PPRNWGDFFT
ATEKLTDTAK NRYGFTIRGG AGSIAQMLEV VYGQSGITEI FDADGKATVN DPKNVAALEK
LAGLYKKVTP EADVSNDYVK MVAQFDGGNI AIMQHNLGSF NDHVKTLGKD KVAAMAVPKS
DGGVQAILSN PVSGIGLFAS GEKKDAAYKF AEFAASKAMN SHWAEKTGVL PANTEVNGEA
WIQGLPHIAE AVKVLNDPAT KVVQMPYYLP EFNAITKTDM EPEFQKVLQG TLPAKDFLDA
FAAKLTEAQA SYKQRNGG