Gene Tcr_1002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_1002 
Symbol 
ID3760488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp1080681 
End bp1081994 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content50% 
IMG OID637785723 
Productdehydrogenase catalytic domain-containing protein 
Protein accessionYP_391271 
Protein GI78485346 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR01348] pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase, long form 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.442522 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACAC AGCAAATTAA TATCCCGGAT ATCGGCGATT TTGATTCAGT AGAAGTTATT 
GAAGTTTTAG TCGCCGAAGG AGATGAAGTC GCTGTTGATG ATTCTTTACT CACGTTAGAA
TCAGATAAAG CAACGATGGA AATTCCAGCG CCTTACGCCG GAAAAATCAC CAAAGTCACT
GTTTCAGTCG GCGATAAAGT CGCGGAAGGC GATGCCGTCT TTGAAATCGA AGTGTCTGAA
GCCGCGGCTT CGGAAGAAAA GCCAGCAGAC AAACCGGCCC CTGAAAAAAC GCCTGAAGCG
CCTAAAGAAG CCCCTAAGCC AGCCGCCGAA ACAGCGCCTG CACCGGCCAC GCCTTCGCCA
ACGGCACAAG CGTTAACCAA ACCCGTCAAT GCACAATCTA TGGGCGCTGC ATCACATGCT
TCGCCATCGG TTAGAGCGTT TGCGCGTAAA CTCGGCGTAG ACATCAGCTC CGTGTCTGGA
AGCGGTCCGA AAGGACGTAT CCAGCAATCC GACATCGAAG CTATGATTAA GTCTGTCATG
CAAGGTGGCG CAGGCGCAGG TCAAGCACAA GGCGGCATGG GCATTCCATC CGTTCCTGAA
ATTGACTTCA GCCAGTTCGG TGAAACCGAA ACGGTTGAGT TGGGTCGTAT CAAGAAAATC
TCTGGTAAGT TCCTGCAAAC CAGCTGGTTG AATGTTCCGC ACGTCACACA GTTTGACGAA
TGCGACATCA CTGAAATGGA CGCCTTCCGT AAGAGCATGA AAGCGAAAGC GGAAAAAGAA
GGCGTGAAAT TGACGCCATT GGTCTTCGTG ATGAAAGCCG TGGTCAAAGC GTTGCAAGAC
TTCCCAAGTT TCAATAGCTC TTTATCACCA GATGGTCAGT CGTTGATTAA GAAACAGTAT
TACAACATCG GTGTCGCGGT CGATACGCCA AATGGCTTGG TGGTACCGGT TCTGCGTGAT
GTCGATAAAA AAGGCATCTA TGAACTGTCT CGTGAACTGA TGGAAATTTC CGGCAAAGCA
CGTGACGGTA AATTATCGCC AAAAGATATG TCTGGCGGCA CTTTCACTAT TTCAAGCCTT
GGTGGCATTG GCGGTACACA ATTCACGCCA ATCGTCAACG CTCCGGAAGT GGCTATCATG
GGGCTTTCAA AAGCGAAAAT GCAACCGGTC TGGAACGGGT CTGAATTTGA ACCTCGCTTG
GTCATGCCGT TCAGTGTGTC GTATGACCAC CGTGTGGTCG ATGGCGCGGA AGGCGTTCGC
TTTACCACCA CTGTCGGTCA GTATTTAACT GATCTACGCC AATTGATTCT GTAA
 
Protein sequence
MATQQINIPD IGDFDSVEVI EVLVAEGDEV AVDDSLLTLE SDKATMEIPA PYAGKITKVT 
VSVGDKVAEG DAVFEIEVSE AAASEEKPAD KPAPEKTPEA PKEAPKPAAE TAPAPATPSP
TAQALTKPVN AQSMGAASHA SPSVRAFARK LGVDISSVSG SGPKGRIQQS DIEAMIKSVM
QGGAGAGQAQ GGMGIPSVPE IDFSQFGETE TVELGRIKKI SGKFLQTSWL NVPHVTQFDE
CDITEMDAFR KSMKAKAEKE GVKLTPLVFV MKAVVKALQD FPSFNSSLSP DGQSLIKKQY
YNIGVAVDTP NGLVVPVLRD VDKKGIYELS RELMEISGKA RDGKLSPKDM SGGTFTISSL
GGIGGTQFTP IVNAPEVAIM GLSKAKMQPV WNGSEFEPRL VMPFSVSYDH RVVDGAEGVR
FTTTVGQYLT DLRQLIL