Gene Noc_2111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2111 
Symbol 
ID3704421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2429011 
End bp2430030 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content53% 
IMG OID637738586 
Productdehydrogenase, E1 component 
Protein accessionYP_344101 
Protein GI77165576 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03182] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0196685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA TTGATCGTAA ACGGCTACTG CGGGAGATGG TTTTCTTCCG CCGCTTTGAG 
GACCGCTCGT TTGAGGCATA CATGGAGCGT AAAGTTGGTG GCTTTCTGCA TCTCTATTCG
GGGCAGGAAG CGGTGGCAAC GGGGGTACTC GAGATGGTGC AAGCGGATCG AGGGGTCGGC
TTCGATTATG CTATTACAGG TTACCGGGAT CATATCCATG CTATCAAAGC GGGAGCACCA
GCGCGGGAAG TTATGGCAGA GCTTTATGGT AAGGAGACCG GGAGTTCCAG AGGGCGTGGG
GGGTCAATGC ATATCTTTGA CCCAAGCGTG CGTTTTATGG GGGGCTATGC CTTAGTAGGC
CAGCCCTTCC CCCTGGCGGC AGGGCTAGCC TTGGCTTGCA AGCACCAGAA AGAAGGACGG
ATCGCGGTCT GCTTCCTTGG GGATGGGGCG AATAACCAGG GTACCTTCCA TGAAACCATG
AACATGGCTT CCCTATGGAA ATTGCCGGTA TTGTTTGTAT GCGAGAATAA CTGCTATGCC
ATCGGTACGG TTATTCAACG ATCAACCGCC GTGATTGACC AATACAAGCG CCTTGAAGCT
TATAATATTC CCGCTAGCCA GCATCCTGGT CAGGATATCG AGGTGGTTAT GGAGGCAGCC
CAATCTGCCA TAGCCCATGT GCGTAGTGGT GCAGGACCTT ATTTCCTGGA ATTTCTGACT
TATCGTTACC GGGGCCATTC CATGTCGGAT GCGGGAGCCT ACCGCAGCAA GGAAGAGGTG
GCGGAGTGGA TGCAGCGGGA TCCCATTCAG ATTCTAGCCA AGCGCCTAAT CGAAGCGGGC
GAATTAACAG AGGAGGAATT CAAAGCCATG GAACAGGCGG TTCAGAGTGA GATCGACAAT
GATATCATCC AATTTGCGGA AGAGAGTCCA GAGCCAAAAG TAGCCGATCT GGCGAAGTAT
GTCCTGGAGG ATAATCCCGA TCCTCGCTGG ATTGGGCCGT TACAGGGGCA AGGAGGATAA
 
Protein sequence
MRKIDRKRLL REMVFFRRFE DRSFEAYMER KVGGFLHLYS GQEAVATGVL EMVQADRGVG 
FDYAITGYRD HIHAIKAGAP AREVMAELYG KETGSSRGRG GSMHIFDPSV RFMGGYALVG
QPFPLAAGLA LACKHQKEGR IAVCFLGDGA NNQGTFHETM NMASLWKLPV LFVCENNCYA
IGTVIQRSTA VIDQYKRLEA YNIPASQHPG QDIEVVMEAA QSAIAHVRSG AGPYFLEFLT
YRYRGHSMSD AGAYRSKEEV AEWMQRDPIQ ILAKRLIEAG ELTEEEFKAM EQAVQSEIDN
DIIQFAEESP EPKVADLAKY VLEDNPDPRW IGPLQGQGG