Gene Sros_5566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5566 
Symbol 
ID8668860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6086601 
End bp6088439 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content73% 
IMG OID 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_003341061 
Protein GI271966865 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTGA CAGTGGCCCA GGCGCTCGTG CGCTTCCTCG CCCAGCAGTG GACCGAACGC 
GACGGCGCCG AGCAGCGGCT GATCGCCGGA TGCCTGGGCA TCTTCGGCCA CGGCAACGTC
GCCGGCATCG GCCAGGCCCT CGCCGAACAG GGCGCCGGCA CCCACGCGGC CCTGCCCTAC
TACCTGGCCC GCAACGAACA GGCCATGGTG CACACCGCCG TCGGCTACGC CCGTACCCGC
GACCGTCTGA CCACCTTCGC CTGCACCACC TCCATCGGGC CCGGCGCCAC CAACCTCGTC
ACCGGCGCCG CCCTGGCCAC CATCAACCGC ATCCCGGTGC TGCTGCTGCC CGGCGACGTC
TTCGCCACCC GCGCCGCCTC CCCGGTGCTG CAGGAGCTGG AAGACCCCCG CTCCTACGAC
GTCACCGTCA ACGACACCCT GCGGCCGGTC AGCCGCTTCT TCGACCGGAT CAACCGCCCC
GAGCAGTTGC CCTCGGCACT GCTGGCCGCG ATGCGGGTGC TGACCGACCC GGCCGAGACC
GGCGCGGTCA CCCTGGCCCT GCCCCAGGAC GTCCAGGCCG AGGCCTACGA CTGGCCCGAG
GAGCTGTTCC GCCGGCGCGT CTGGCACGTG GCCCGCCCGG TGCCCGAACC CGCCGCGCTG
GCCCGCGCCC AGGACCTGCT GCGGCGATCA CGCCGGCCGC TGATCGTGGC CGGCGGCGGC
GTCAAACACA GCCAGGCCTC GCATCAGCTG GCCGCCTTCG CCGCCCGCCA CCGCATCCCG
GTCACCGAGA CCCAGGCAGG CAAGGGCGCC GTGCCGTACG ACCACCCGTA CGCCGCCGGC
GCGATCGGCC ACACCGGCAG CGCCGCGGCC AACACCCTGG CCCGCGAGGC CGACCTGGTC
ATCGGCATCG GCACCCGCTA CAGCGACTTC ACCACCGCCT CACGCACGCT GTTCGCCGGC
GCGGCCTTCC TCAACATCAA CATCACCGCC TTCGACGCGG CCAAGCACTC CGGCCAGATG
CTCGTCGCCG ACGCCCGCCA GGCGCTGGAC GCGCTGGAGC CCGGCGACTG GAACGCCGAC
CCCGCCTGGA GCGCCAGAGC CACCGAGCTG ACCCGCGACT GGCAGGCCGA GATCGAGCGC
GCCTACGGGG GGACGGAGCT GACCCAGCCG GTGATGCTGG GAATCGTCAA CCAGGCCGCC
GAGGGCGGCG TGGTGGTCAA CGCGGCCGGG TCCATGCCCG GCGACCTGCA CAAGCTGTGG
CGGGCCACCG ACCCCGGCCA GTACCACGTG GAGTACGGCT ACTCCTGCAT GGGATACGAG
ATCGCCGGCG GGCTCGGGGT GAAGCTGGCC GCGCCGGAGC GAGAGGTGTT CGTGCTGGTC
GGCGACGGCT CCTACCTGAT GATGGCCCAG GAGATCGCCA CCGCCGTGCA GGAGGGCGTC
AAACTCGTCG TGGTGCTGGT CGACAACCAC GGCTTCGCCT CCATCGGCAA CCTCAGCGAA
TCCGTCGGCG CCCAGCGGCT CGGCACCTCC TACCGGATGC GCGGCCCCTC GGGCGAGCTC
GACGGGGCCT TCCTCCCGGT GGACCTGGCC GCCAACGCCG CCAGCCTGGG CGCCGACGTG
CTGACGGCGA ACGACCCCGG CACGCTGCGG ACCGCACTGG CCAAGGCCAT GGCGTCCACG
CGCACCACGG TCGTCCACGT CGAGACCGTC CCCGGCCCAA GTCCCGAGAC CACGGCCTGG
TGGGACGTGC CGGTGGCCGA GGTGTCGAGC CTGCCCGAGG TCAGGACCGT GCGCCGGCAC
TACGAAGACC ACAAACGCGA CCAGCGGCCC TACCTCTGA
 
Protein sequence
MRLTVAQALV RFLAQQWTER DGAEQRLIAG CLGIFGHGNV AGIGQALAEQ GAGTHAALPY 
YLARNEQAMV HTAVGYARTR DRLTTFACTT SIGPGATNLV TGAALATINR IPVLLLPGDV
FATRAASPVL QELEDPRSYD VTVNDTLRPV SRFFDRINRP EQLPSALLAA MRVLTDPAET
GAVTLALPQD VQAEAYDWPE ELFRRRVWHV ARPVPEPAAL ARAQDLLRRS RRPLIVAGGG
VKHSQASHQL AAFAARHRIP VTETQAGKGA VPYDHPYAAG AIGHTGSAAA NTLAREADLV
IGIGTRYSDF TTASRTLFAG AAFLNINITA FDAAKHSGQM LVADARQALD ALEPGDWNAD
PAWSARATEL TRDWQAEIER AYGGTELTQP VMLGIVNQAA EGGVVVNAAG SMPGDLHKLW
RATDPGQYHV EYGYSCMGYE IAGGLGVKLA APEREVFVLV GDGSYLMMAQ EIATAVQEGV
KLVVVLVDNH GFASIGNLSE SVGAQRLGTS YRMRGPSGEL DGAFLPVDLA ANAASLGADV
LTANDPGTLR TALAKAMAST RTTVVHVETV PGPSPETTAW WDVPVAEVSS LPEVRTVRRH
YEDHKRDQRP YL