Gene Rcas_2804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2804 
Symbol 
ID5540291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3625709 
End bp3627046 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content64% 
IMG OID640894931 
Productdehydrogenase catalytic domain-containing protein 
Protein accessionYP_001432893 
Protein GI156742764 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTTG ATATCGTTCT TCCACAGATC GGCGAAAGTA TGACCGAAGC CACGATCGGG 
CGCTGGCTCA AGCGTGTCGG CGACCGCATC GAACGCTTCG AGGCATTGGT GGAAGTCGAG
ACGGATAAAG TCTCGACCGA AGTGACCTCG ATTGCCAGTG GCATTTTGCT CGAAATCGTG
ACGCCTGAAG GCGCCACAGT GCCGGTTGGC ACGCTTCTGG CGCGTATCGG CGAGACGGCG
GAGAGGCATG TGAGCGCAGC GCCAGCGCCG TCGCAGGAGA CAACGGCAGC GCCAGAACCT
GTGCGCATCC GCCGCGGCGA TGGTCCGCCG ATCACGCCGG TGGTGGCGCG TCTGGCTGCT
GAATATGGTA TCGACCTGAG CCAAATCCGT GGCACCGGCG CCGGCGGGCG CGTCAGCAAG
AAGGATGTGT TGCGCTACAT CGAGATGCAG AAAGCGGCTG CCGCTTTGCT GCCCGGCGCA
CCCACTGCGC CGCCTCCGGC GCCCGAAGCG CCTCCCATCC CATCTGTTTC CACAGCGCCA
TCACCCCCTC TAGCGCGCGA AACGCCTTCT ACTGCGCCTG TTGCCGAAGC GCCGCCTGCC
CTGCCCACAG CGCAGCGCCC TCCAATCACG CAACCGTTGC CCGACGAGGC GATCCTCACG
CCATTGACCA CGATGCGACG CATGATCGCC GATCATATGG TCCGCTCCCT GCGCGACGCC
CCGCAGGCCA CGACGGTCTT TGAGGTCGAT ATGGGGCGCG TGCTGGCGCA CCGCGACCGG
TATCGCGCCT CTTTTGAACA GCAAGGGATA CGGTTGACTC TGACAGCGTA TGTGGTTCAG
GCGGTTGCGA CTGCGCTGCG CCGCGTTCCG GCATTGAACA CGCGCTTCAC TGACGAAGGG
ATCATCACAT ACCGGCGGAT CAACATTGGG GTGGCGGTCG CCCTCGACGA CGGATTGATC
GTGCCGGTGC TGCGTGACGC CGACGAGAAA AGTCTGGCCG GCATCGCGCG CGCGTTGAAC
GACCTGACGG AGCGCGCCCG CGCGCGCCGC CTGCAACCGG ACGACACCGA AGGGGGAACG
TTTACCATCT CGAACCATGG CGTTGGCGGC AGTCTGTTCG CCACGCCGAT CCTCAACCGT
GGACAGAGCG GTATTCTTGG CGTCGGCGCC GTGGTGAAGC GCGCGGTCGT TGTGACCCAT
CAGGGGAATG ATGCGATTGT CATTCGCCCG ATGTGCTACC TGTCGTTGAC ATTCGACCAC
CGCGCCTGTG ATGGCGCGAC CGCCGACGCA TTTCTGGCAG CGGTCAAAGA GGTTCTGGAA
ACCTACCCCG AGCAATAA
 
Protein sequence
MAVDIVLPQI GESMTEATIG RWLKRVGDRI ERFEALVEVE TDKVSTEVTS IASGILLEIV 
TPEGATVPVG TLLARIGETA ERHVSAAPAP SQETTAAPEP VRIRRGDGPP ITPVVARLAA
EYGIDLSQIR GTGAGGRVSK KDVLRYIEMQ KAAAALLPGA PTAPPPAPEA PPIPSVSTAP
SPPLARETPS TAPVAEAPPA LPTAQRPPIT QPLPDEAILT PLTTMRRMIA DHMVRSLRDA
PQATTVFEVD MGRVLAHRDR YRASFEQQGI RLTLTAYVVQ AVATALRRVP ALNTRFTDEG
IITYRRINIG VAVALDDGLI VPVLRDADEK SLAGIARALN DLTERARARR LQPDDTEGGT
FTISNHGVGG SLFATPILNR GQSGILGVGA VVKRAVVVTH QGNDAIVIRP MCYLSLTFDH
RACDGATADA FLAAVKEVLE TYPEQ