Gene Sros_3075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_3075 
Symbol 
ID8666362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp3355418 
End bp3356713 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content66% 
IMG OID 
Productsugar ABC transporter periplasmic sugar-binding protein 
Protein accessionYP_003338768 
Protein GI271964572 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.111769 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCCAC CCAGGACGGC GTTAGCGGCA GGGCTGCTGG CGCTGGCGGT TCTCACCACC 
GGGTGCGTGG CGGGCACCTC CGCCGGCGCC CCCTCCGCCG CGGCCGACCA GCCGTTCGAG
GGTGAGGTCG AATTCTGGAC GATCAACCTG AAGAAGAACT TCAACGACTA TGTCACCGGG
CTGATCACCC AGTACCAGAA GGAACACCCC AAGGTCACCG TCAAGTGGGT GGACGTGCCC
GGCCAGGACA GCGCGACCAA GCTGCTCGCG GCCATGGCCA GCGGTGACGT GCCCGACGCG
GTCAACCTGG GCTCTCCCGA CATCGGCAGG TTCATCCCGT CGCTGGCGCC GATGGACGAC
TACTTCAAGC CGGAGGACCT CGCCGACTTC CAGCCGAACC TGGTGGAGCC GCTGCGCCAG
GACGGCAAGC TCTACGGGGT GCCCTGGTAC AACGGCGGCG CCCCGGTGGC GATGTACCGC
AAGTCGGTCG TGTCCAAGGC CGGCTTCGAC GAGAAGGCGC CGCCGAAGAC CTACGACGAG
GCACTGGACC TGGCGGCCAA GGTCTACGAC GAGACCAAGG TCTACGGCAT CAACGAGATC
CCCGGGCCGT CCGTCGTCTC CGTGCTGCGC TACTACGGGG TCACGCTGCT GTCGGAGGAC
AGGAAGAAGG CGGCGTTCAA CACCCCCGAG GTCGCCGCGA TCATCGAGAG GTTCAAGAAG
AGCTACGACG AGCACGGCAT CGCGCCGGGC TCCGTCTCCA AGGACGTCCG CGCCCTTCCG
CAGAGCCTCG ACAACGGCCA GGTCGCCTTC ACGGCCAGCG CCAACGGCTC GACCCTGGTC
AACATCCAGA AGAACGCCCC CGACATCTAC AAGGACCTCG TCGTCACCGA GCCCGTCCGG
ACGGCCGGCG GCGGCTACCT GCTCAACGCC CAGCAGACGT TCACGATCCC CAAGGCCTCC
AAGCACAAGA AGGCGGCGGC CGAGTTCATC AAGTTCTTCA CCAACGGCGC CAACCAGCTC
GCCTTCTGCA AGATCGTGCC GATCTACCCG TCGACGATCT CCTCGACGAA GGACGCCTTC
TTCACCGGCA CCGGCGGCAC CGAGCCGATG GACGTCGCCC GCCAGGTGAT CGTCAAGGGG
CTGCCGAAAC TGGAGTACAC ACCGATGGGC ACGGCCAAGG ACACCGAGCT CGCCGAGTCC
CTGGCGGAGG AGATCCGCGC CGTGTTCCAG GGACAGAAGA GCGTGAAGGA CGCGCTCGAC
ACCGCAGAGA AGAATTGGAA TGACGCTCTT GTCTAA
 
Protein sequence
MTPPRTALAA GLLALAVLTT GCVAGTSAGA PSAAADQPFE GEVEFWTINL KKNFNDYVTG 
LITQYQKEHP KVTVKWVDVP GQDSATKLLA AMASGDVPDA VNLGSPDIGR FIPSLAPMDD
YFKPEDLADF QPNLVEPLRQ DGKLYGVPWY NGGAPVAMYR KSVVSKAGFD EKAPPKTYDE
ALDLAAKVYD ETKVYGINEI PGPSVVSVLR YYGVTLLSED RKKAAFNTPE VAAIIERFKK
SYDEHGIAPG SVSKDVRALP QSLDNGQVAF TASANGSTLV NIQKNAPDIY KDLVVTEPVR
TAGGGYLLNA QQTFTIPKAS KHKKAAAEFI KFFTNGANQL AFCKIVPIYP STISSTKDAF
FTGTGGTEPM DVARQVIVKG LPKLEYTPMG TAKDTELAES LAEEIRAVFQ GQKSVKDALD
TAEKNWNDAL V