Gene Rcas_3594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3594 
Symbol 
ID5541095 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4691015 
End bp4692739 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content63% 
IMG OID640895713 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001433661 
Protein GI156743532 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGGTG GTGATCGCGT GGCACTGGCG CTTCAGGCGC ATGGGGTGCG TTTCGTTTTT 
ACCCTGACCG GCGGGCATAT CGCGCCCATC CTGGTTGGTT GCCGTCAACG CGGCATTCGT
GTGATCGACA CGCGCCATGA GGCGACTGCC GTGTTCGCCG CCGATGCTGT GGCGCGACTG
ACCGGCGTGC CCGGTGTCGC TGTAGTGACT GCCGGTCCTG GCGTGACGAA CACGATCACG
GCGATCAAGA ATGCGCAGAT GGCGCAATCG CCGCTGGTGC TGATCGGTGG CGCAGCGGCG
ACTGCACTGC GGGGACGCGG CGCGCTTCAG GATATTGACC AGATGGCGCT GATGCAATCA
ATCACGAAGT GGCGTCGGTC GGTTGCGCGG GTGCGGGAGA TTATTCCGGC GCTGGAGGAG
GCATTCTTTC AGGCGCGCAG CGATGTGCCC GGTCCGGTGT TCGTCGAACT GCCGGTCGAT
CTGTTGTACG ACGAACAGGT GGTGCGCCGG TGGTACGGTC TCAATCGTAG TGGACGCTCT
CCAAAGCAAT GGATTGTTCA GCGGTATCTC GACTGGCGGG TGAGTCGATT GTTTGCCGGC
GTGCAGGATC AACCGGTGGC GACGTCCCGT TTGGTCGATG TTCCTGAACC AGAAGACGGC
GCGGTGCGCG CGGCGGTGCT CCGCCTGGTG CAATCACAAC GTCCGCTCCT GTTGGTAGGA
AGTCAGACGA TGCTCGATAC GGCGTCGGTC GGTGCGTTGG CGAAGGCAGT CACCCGGCTG
GGTGTGCCGA CGTATCTCTC TGGCATGGCG CGTGGGTTGC TCGGCGCGAA TCACGCATTG
CAGTTGCGGC ACCGGCGGCG CGAGGCGCTG CGCGAAGCCG ATCTGGTCAT TCTTGCGGGT
GTGCCGTGCG ACTTCCGCCT CGACTATGGC AACCATATTG CGCGTCGCGC CACGCTTATC
GCCGCCAACC GCAGCCGCAC CGATCTGATG CTCAACCGGC GCCCCGACAT TGCGGTGTTG
GGAGACCCGG CGCGCTTTGT GTGTATGCTG GCGGAACGGG TGTTGCCCGG TGATCCCTGG
AAGGAATGGA TCGAACGGTT GCGTCAGCGC GATGCCGTCC GCGATGCTGA GATTGTGCGT
CAGGCAGCCG AACCGACATC CTATGTCAAT CCGATCCATC TCTGCCGGGT GATCAACGAG
ACGCTTTCCG ATCACAGCAT AATTGTTGCC GATGGCGGCG ACTTTGTGGC AACCGCGTCG
TACACGGTGC GTCCGCCGCG CCCGTTAAGC TGGCTCGATC CGGGACCGTT CGGCACACTC
GGCGTTGGCG CGGGATTTGC GCTCGGTGCG AAACTCTGTC AGCCTGATGC GGATGTCTGG
CTTCTGTATG GAGATGGGTC TGCCGGCTAC AGCCTGACCG AATTCGACAC TTTTGTGCGC
CACCAGGCGG CAGTGGTTGC GGTCATAGGA AATGACGCTG GCTGGACGCA AATTGCGCGT
GAGCAGGTGG AGATTCTGCA CGACGATGTG GCAACCACGC TGGCGTATAG CGACTACCAT
CGGGTGGCGG AGGGGTTCGG CGCTGCTGGC TATCGCCTCG ACGACCCGGA ACTGGCGGAG
GAGACCTTGA AGCAGGCGCG CCAGACGGCT GCCCAGGGCA GACCGGTGCT CGTCAATGCT
CTGATCGGCA AAACCGATTT TCGTAAGGGG TCGATTTCCA TGTGA
 
Protein sequence
MHGGDRVALA LQAHGVRFVF TLTGGHIAPI LVGCRQRGIR VIDTRHEATA VFAADAVARL 
TGVPGVAVVT AGPGVTNTIT AIKNAQMAQS PLVLIGGAAA TALRGRGALQ DIDQMALMQS
ITKWRRSVAR VREIIPALEE AFFQARSDVP GPVFVELPVD LLYDEQVVRR WYGLNRSGRS
PKQWIVQRYL DWRVSRLFAG VQDQPVATSR LVDVPEPEDG AVRAAVLRLV QSQRPLLLVG
SQTMLDTASV GALAKAVTRL GVPTYLSGMA RGLLGANHAL QLRHRRREAL READLVILAG
VPCDFRLDYG NHIARRATLI AANRSRTDLM LNRRPDIAVL GDPARFVCML AERVLPGDPW
KEWIERLRQR DAVRDAEIVR QAAEPTSYVN PIHLCRVINE TLSDHSIIVA DGGDFVATAS
YTVRPPRPLS WLDPGPFGTL GVGAGFALGA KLCQPDADVW LLYGDGSAGY SLTEFDTFVR
HQAAVVAVIG NDAGWTQIAR EQVEILHDDV ATTLAYSDYH RVAEGFGAAG YRLDDPELAE
ETLKQARQTA AQGRPVLVNA LIGKTDFRKG SISM