Gene Nham_3107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_3107 
Symbol 
ID4029896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp3427833 
End bp3428933 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content69% 
IMG OID637971522 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_578304 
Protein GI92118575 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAGT TCACATTGCC CGATCTGGGC GAGGGCCTCG AGGAGGCCGA AGTCGTCGCC 
TGGCACGTCA ATGAGGGCGA CCATATTGTG ACCGATCAGC CGCTGCTGTC GGTGGAGACC
GACAAGGCGG TGGTCGAAGT GCCGTCGCCG TGGAGCGGAC GCATCGCGCG GCTGTGCGCG
GAGAAGGGCG ATCTGGTCAA GGTCGGCGCG CCGCTGGTGG AATTCGCCGC CGACGCCGAG
CGGGACACCG GTACGGTGGT CGGCCAGCTT GAGAGCAGCG AGGAACGCGA CGCGAAGGCC
CCAAAGGTCG CACCGGCGCG GCGAGGCACG GCACAGGCCG CGCCGGCGGT CCGTGCGCTC
GCCCAAAAAC TCGATGTCGA TCTCAACGCG GTGCAGCCGA CCGGCCCCGA TAACACCATT
ACGCGTGCGG ATGTCGAACG CGCCGCGCGC AGCCTCGCCG AGGCTGGACC GGCGCAGGTG
CTGCGCGGAA TGCGGCGCGC GATGGCGCAG CGCATGACCG CCGCACACGC CGAAGTCGTT
CCCGCCACCG TCACGGACGA CGCCGACATC GAGGAGTGGC GCAAGGACGA AGACGCCACG
ATCCGCCTGA TGCGGGCTAT CGCAGCAGCG TGCAAAGCCG AACCCGCGCT CAATACATGG
TACGATTCCC GTGCCGGCGA GCGTCGTCCG ATCACGCGCG TCGATATCGG AATCGCGATC
GACACCGAAG GTGGCCTGAT CGTGCCGATC GTGCGCAACG TCGCCGCGCG CGATGCACAT
GACTTGCGCG CCGGGCTCGA CCGGCTGCGC ACCGATGCGG CCGCGCGACG AATTCCGCCG
GAGGAATTGC GCGGCGCCAC CATCACGTTG TCGAATTTCG GCATGATCGG CGGCCGCTTC
GCGAATCTCG TTGTGGTGCC GCCGCAGGTG GCCATTGTCG GCGCCGGACG CATCGTCCAG
CGCGTGGTGG CGCATCACGG CCAGCCGGCG GTGCGCCGCG TGCTGCCGTT GTCGCTTTCG
TTCGACCATC GCGTGGTGAC CGGGGGCGAG GCCACGCGCT TCCTGATGGC GCTGAAGGCG
GATATCGAGC GCTCCGCATA G
 
Protein sequence
MQQFTLPDLG EGLEEAEVVA WHVNEGDHIV TDQPLLSVET DKAVVEVPSP WSGRIARLCA 
EKGDLVKVGA PLVEFAADAE RDTGTVVGQL ESSEERDAKA PKVAPARRGT AQAAPAVRAL
AQKLDVDLNA VQPTGPDNTI TRADVERAAR SLAEAGPAQV LRGMRRAMAQ RMTAAHAEVV
PATVTDDADI EEWRKDEDAT IRLMRAIAAA CKAEPALNTW YDSRAGERRP ITRVDIGIAI
DTEGGLIVPI VRNVAARDAH DLRAGLDRLR TDAAARRIPP EELRGATITL SNFGMIGGRF
ANLVVVPPQV AIVGAGRIVQ RVVAHHGQPA VRRVLPLSLS FDHRVVTGGE ATRFLMALKA
DIERSA