Gene Sros_9103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_9103 
Symbol 
ID8672449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp10048478 
End bp10050211 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content71% 
IMG OID 
Productthiamine pyrophosphate protein domain protein TPP-binding protein 
Protein accessionYP_003344469 
Protein GI271970273 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCACCG TTGCGGAGCA GTTCGTCGAG GTGTTGCGCC AGGCCGGGGT GGAGCGGGTC 
TACGGGGTGG TGGGCGACAG CCTGAACCCG GTCGTGGACG CGATCCGCAA GACCGGCGGC
ATCGAGTGGG TGCACGTGCG CAACGAGGAG GCGGGGGCGT TCGCCGCCGC GGCCGAGGCG
CAGATCACCG GCCGCCTGGC GGTCTGCGCG GGGAGCTGCG GACCCGGTAA CACGCACCTG
GTCCAGGGGC TCTACGACGC CCACCGGAGC GGCGCCCCGG TGCTCGCCCT TGCCTCGCAC
ATCTCCAGCG CGCAGATCGG CACGGGGTTC TTCCAGGAGA CCCACCCGGA CCGCCAGTTC
GTGGACTGCA GCGGCTACTG CGAGATGATC AGCAGCGCCG AGCAGATGCC CCGGGTGCTG
CGCATCGCCG TCCAGCACGC GATCGGCCAC AGCGGCGTCG CGGTCGTGGT GCTCCCCGGC
GACGTGGCGG ACCTGCCGGC CGCGCGCGGC ACCGGCACCC ACGAGTTCCT CACCCGGCAG
GGCACGATCC GGCCCCTTTC CGACCAGGTG GCGGAGCTGG CCACGGCGCT GAACAGCGCG
GAGAAGGTGA TGCTGTTCTG CGGCGCCGGG GTGCGCCGCG CGCACGAGGA GGTCATGTCA
CTGGCCGCCC GCACGCTGGC CCCGGTCGGG CACGCGCTGC GCGGCAAGGA GTGGATCCAG
TACGACAACC CGTACGACGT GGGGATGAGC GGGCTGCTCG GCTACGGCGC GTGTTACGAG
GCCATGCACG AGGCCGACCT GGTGGTGCTG CTCGGCACCG ACTTCCCCTA CGACGACTTC
CTGCCGGGCA GGCGGACGGT CCAGATCGAC CACGACCCCG CGCAGCTGGG CCGCAGGACC
CCGCTGGAGC TGGCCGTGCA CGGCGACGTC CGCGAGACGC TGCTCGCGGT GCTGCCCCAG
GTCGCGCAGA AGACGGACCG GCGCTACCTC GACAAGATGC TGTCCAAGCA CGTGAAGACG
CTGGACAACG TCGTGAACGC CTACACCCGC GACATCGAGC ACCACACACC GATCCATCCG
GAGTACGTGG CGAGCGTCGT GGACGAGCTC GCCGCCGACG ACGCGGTGTT CACCGTTGAC
ACCGGCATGT GCAACGTCTG GGCGGCGCGC TATCTCACTC CCAACGGCCG CCGCAGGGTG
ATCGGCTCCT TCAAGCACGG GAGCATGGCC AACGCGCTCC CGCACGCCAT CGGCGCGCAG
CTCGCCGGCC GCGGGCGGCA GGTCGTCTCG CTCTCCGGCG ACGGCGGGCT CGGCATGCTC
ATGGGCGAGC TCCTCACCGC CCGGATGTAC GACCTGCCCG TCAAGATCGT GGTGTTCAAC
AACTCCTCGC TCGGCATGGT GAAGCTGGAG ATGCTGGTCG ACGGGCTGCC CGACTTCGGC
ACCGACGTCG CGCCCGTCGA CTACGCGGCG ATCGCCGCCG CGATCGGGCT GGGGTCGGTC
CGGGTGGAGA AGCCCGCGCA GGTCCGGGAG GCGCTGGCCA CCGCCTTCGC GGCGCCGGGG
CCGTACCTGG TGGACGTGGT CACCGACCCC GACGTGCTCT CCATGCCGCC GCGCATCACC
GCCAAGCAGG TCAAGGGGTT CGCTCTGGGG GCGGGGAAGG TCGTGCTGAC CGGCGGGGTG
GGACGCATGA TCGACATGGC CAGGGCGAAC CTGCGGAACA TCCCGCGCCC GTGA
 
Protein sequence
MGTVAEQFVE VLRQAGVERV YGVVGDSLNP VVDAIRKTGG IEWVHVRNEE AGAFAAAAEA 
QITGRLAVCA GSCGPGNTHL VQGLYDAHRS GAPVLALASH ISSAQIGTGF FQETHPDRQF
VDCSGYCEMI SSAEQMPRVL RIAVQHAIGH SGVAVVVLPG DVADLPAARG TGTHEFLTRQ
GTIRPLSDQV AELATALNSA EKVMLFCGAG VRRAHEEVMS LAARTLAPVG HALRGKEWIQ
YDNPYDVGMS GLLGYGACYE AMHEADLVVL LGTDFPYDDF LPGRRTVQID HDPAQLGRRT
PLELAVHGDV RETLLAVLPQ VAQKTDRRYL DKMLSKHVKT LDNVVNAYTR DIEHHTPIHP
EYVASVVDEL AADDAVFTVD TGMCNVWAAR YLTPNGRRRV IGSFKHGSMA NALPHAIGAQ
LAGRGRQVVS LSGDGGLGML MGELLTARMY DLPVKIVVFN NSSLGMVKLE MLVDGLPDFG
TDVAPVDYAA IAAAIGLGSV RVEKPAQVRE ALATAFAAPG PYLVDVVTDP DVLSMPPRIT
AKQVKGFALG AGKVVLTGGV GRMIDMARAN LRNIPRP