Gene Rcas_1531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1531 
Symbol 
ID5539007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1950355 
End bp1951554 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content61% 
IMG OID640893669 
Product2-oxoglutarate dehydrogenase, E2 subunit, dihydrolipoamide succinyltransferase 
Protein accessionYP_001431642 
Protein GI156741513 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01347] 2-oxoglutarate dehydrogenase complex dihydrolipoamide succinyltransferase (E2 component) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.199399 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0219114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTCG AGATTAAGGT CCCAACACTG GGAGAGTCGA TCGTCGAAGC CACGGTTGGG 
GCATGGCGCA AACACGAAGG CGACCCGATC ACCGCCGGTG AGGTGCTGGT CGAACTCGAA
ACCGATAAAG TAACCGTTGA GGTGACCGCC GAAGAGTCTG GGGTTCTGAG CCACATTCTC
AAACCTGATG GCGCGATTGT GACCATGGGC GAAATCCTGG GCATTATCGC TGAAACCGCT
GAGACGCCGG TCGCGGCGCA GTCGCACGAT GGCGCATCCG GAACCAGAGT GATGGCTACT
CCGGTCGCGC GCCGGGTCGC CGAGACGCAG GGGGTGGACA TTGCCGCCAT CCCCGGCAGC
GGTCCCGGCG GTCGGGTGAC GAAAGAGGAC GTGCTCAAAC GCGAGAGGGC GCCCCGTCCT
GCGCCGATGG AACACACCCC CCCGCCGCCC GCTCCGACAT CGGCGCCTCC GCCTGTTCCT
GCGCCGGTCT CCGGCGAAGG ACGGCGTGAG GAGCGCATTC GGATGAGTCG CCGCCGCCAG
ACGATTGCCG CCCGTCTTGT TGAAGCGCAA CGCACAGCGG CAATGCTGAC GACATTCAAC
GAGATCGATA TGAGCGCCGT CATCGACCTG CGCAAGCGCC ATCGCGATCC GTTTCGCGAG
CGTCATGGCG TCGGTCTCGG TTTCATGTCG TTCTTCACGA AAGCAGTCAT CGGCGCGCTG
AAAGCCTTCC CGTTGCTCAA TGCCGAAATC CGTGGCGACG AGATCATCAT CAAACACTAT
TACGACATCG GTATTGCCGT CAGCACCGAC GAAGGGTTAG TCGTGCCGGT GTTGCGCGAC
GCCAATCGCC TCAGTTTTGC CGAGATCGAG CGCGGCATCG AAGAACTGGC GCGCCGCGCC
CGCGAGTCAA AACTCACGAT TGCCGATCTC CAGGGAGGCA CCTTCACCAT CACCAACGGC
GGTATCTTCG GCTCACTCAT GTCAACGCCA ATCCTCAACA CACCGCAGGT TGGCATCCTG
GGCATGCACA AGATTCAGGA GCGCCCGGTA GCGCTTGATG GGCAGGTCGT CATTCGCCCG
ATGATGTATG TCGCGCTTTC CTACGATCAC CGCATTATCG ACGGGCGTGA GGCGGTATCA
TTTTTGGTGC GGGTGAAAGA ACTGGTGGAA GACCCAGAGC GACTGTTGCT GGAGGGGTAA
 
Protein sequence
MAVEIKVPTL GESIVEATVG AWRKHEGDPI TAGEVLVELE TDKVTVEVTA EESGVLSHIL 
KPDGAIVTMG EILGIIAETA ETPVAAQSHD GASGTRVMAT PVARRVAETQ GVDIAAIPGS
GPGGRVTKED VLKRERAPRP APMEHTPPPP APTSAPPPVP APVSGEGRRE ERIRMSRRRQ
TIAARLVEAQ RTAAMLTTFN EIDMSAVIDL RKRHRDPFRE RHGVGLGFMS FFTKAVIGAL
KAFPLLNAEI RGDEIIIKHY YDIGIAVSTD EGLVVPVLRD ANRLSFAEIE RGIEELARRA
RESKLTIADL QGGTFTITNG GIFGSLMSTP ILNTPQVGIL GMHKIQERPV ALDGQVVIRP
MMYVALSYDH RIIDGREAVS FLVRVKELVE DPERLLLEG