Gene Sros_2194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_2194 
Symbol 
ID8665476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp2359477 
End bp2360808 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content68% 
IMG OID 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_003337920 
Protein GI271963724 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAATCA CGAAGATCGC GGTTACGACT GCTACCACCG CCGCCCTGGC CCTGGGCCTC 
GCCGCATGTG GTTCCACCGA CGAGCCCGCC AAGAGCGCCG ACCCCCAGGC AGGCGCCTCC
ACCGCCGCCT CCGGCCCGAA GTACGCCGGC CAGACCCTGA CCGTGTGGCG TCTCGGCGAC
AGCAACCCGG CCGCCCAGAA GTACATGGAC GAGCTGAACG CGGCCTTCGA GAAGGAGTCC
GGCGCCAAGA TCAAGCTTGA GTGGATCCCG TGGCCCCAGG TGAACGACAA GTTCACCGCC
GCCGCGGCGG GCGGCGCCGG CCCGGACGTC ACCGAGATCG GCAACGACCA GGTGCCGCTC
TGGCAGAGCC AGGAAGCCCT CTCCCCGGTC ACCCCGCTGG CGGAGGCCGG CGACCAGGCC
CAGATTCCGA AGAACCTGCT CGGCCTGGAG ACCATCGACA ACGAGGTCTA CGCCCTGCCC
TGGGGCGCCG GCTCCCGCGC GGTGCTCTAC CGCAAGGACT GGTTCAAGGA CCTCGACATC
GAGGTTCCGA AGACCTGGGA CGAGCTGGTC GCCGCGGCCA AGAAGATCCA GGAGAAGAAG
GGCAAGGACG TCGACGGCTT CGCCTTCAAC GGCGGCTCCG ACGCCAACCA CCTGCTCGGC
GCGTTCGCCT GGTCCGAGGG TGGCGAGTAC GCCCTCAAGG AGGGCGGCAA GTGGGTCGGC
AAGGTGACCG ACCCCAAGTT CAAGGCCGGT TTCGCCACCT ACACGAGCCT GGTCACCGAC
GGCCTGTCCG GTAAGTCGCG CCTGACGCAG AACACGGTCG ACATCCGCAA GCGGTTCGCC
AACGACAAGG TCGGCATGTA CCTGACGGCC GGCTGGGACC TGCCCGGCAT CGAGGTGGAC
AGCAAGGGCA AGCTGAAGGC CGACAAGCTG GCCTTCTTCC CGCTCCCGGC CAAGGCCGGC
GGCGAGGCCC CGTCCTTCTT CGGCGGCAAC GACATCGCCA TCTGGGACAG CGCCAAGAAC
AAGGAGCTCG CGGCCGAGTA CCTGAAGCTG GCCACCAACA AGGAGTGGGC GGAGCGTTAC
GCCCTTGAGG GCGGCCTGCT CCCGATCTAC CCCGAGGCTC TGGCGAAGCT GAGCTCGGAC
CCGGCGCTGG CCCCGTTCGC GGCGGCCTTC GCCAAGGCCA AGGCCTTCCC GGCCGACTCC
AACTGGACCG AGGCCAACGA GACCAAGGCC GTGCTGCAGA ACGCCGCGCG GTCCGTCATC
GAGGGCAAGG CCACCCCTGA CGAGGCTCTC ACCAAGGCCA ACGGGGAAAT CGAAGAGATC
CTGAACCAGT AG
 
Protein sequence
MKITKIAVTT ATTAALALGL AACGSTDEPA KSADPQAGAS TAASGPKYAG QTLTVWRLGD 
SNPAAQKYMD ELNAAFEKES GAKIKLEWIP WPQVNDKFTA AAAGGAGPDV TEIGNDQVPL
WQSQEALSPV TPLAEAGDQA QIPKNLLGLE TIDNEVYALP WGAGSRAVLY RKDWFKDLDI
EVPKTWDELV AAAKKIQEKK GKDVDGFAFN GGSDANHLLG AFAWSEGGEY ALKEGGKWVG
KVTDPKFKAG FATYTSLVTD GLSGKSRLTQ NTVDIRKRFA NDKVGMYLTA GWDLPGIEVD
SKGKLKADKL AFFPLPAKAG GEAPSFFGGN DIAIWDSAKN KELAAEYLKL ATNKEWAERY
ALEGGLLPIY PEALAKLSSD PALAPFAAAF AKAKAFPADS NWTEANETKA VLQNAARSVI
EGKATPDEAL TKANGEIEEI LNQ