Gene Noc_2110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2110 
Symbol 
ID3704420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2428029 
End bp2429009 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content53% 
IMG OID637738585 
Productpyruvate/2-oxoglutarate dehydrogenase complex dehydrogenase (E1) component 
Protein accessionYP_344100 
Protein GI77165575 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.140374 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAAC TGGCTTACTG GGAGGCACTG CGCCGTGCCC ACGATGAAGA ACTGGCCCAT 
GATCCCCTGG TTATTGCCAT GGGCGAGGAT ATTGGGGTGG CGGGCGGTAC CTATAAAGTT
ACCCTGGGCC TCTACGGCAA ATATGGGGAG GAGCGAATTA TTGATACCCC TATTTCCGAG
AATTCCTATA CCGGTATCGG AATTGGGGCC TCGATGGCCG GAATGCGGCC TATCATCGAA
ATCATGTCCA TTAATTTTGC CTTGCTGGCT CTGGATACTC TCATCAATGC GGCTGCTAAG
ATCCGTTATA TGTCGGGTGG CCGCGCTCAG TGTCCTATCG TAATGCGAAC TCCAGGGGGA
ACGGCCCACC AGCTTGCCGC TCAACATTCG GCACGGTTAT CAAGGCTCTT TATGGGAACG
CCGGGTCTGC GGGTTGTCAC GCCGAGTACC CCCTTGGATG CCTACGGCAT GCTTAAATCT
GCGGTGCGTT GTAACGATCC AGTGATCTTT CTTGAGCACG AAAGTATGTA TAACCTCAAA
GGGGAAGTGC CCGATGAGGA GACTTTTCGG CCTTTGGAAG GTGCCGGGGT CGTTCGTGAG
GGAACGGATA TTACCCTTAT AGGCTATAAC TATAGCGTGC ATTGGTGTTT AACCGCGGCG
GATAAATTGG CCCAGGAAGG CATTCATGCC GAGGTTATTG ATTTACGCTC CCTTAAACCC
ATCGACCGGG AAACCATTCG CCGCTCCATA GAAAAAACCC ACCGGGTTCT GGTGGCCGAA
GAAGATGAGG CGCCGGTGGG TGTTGGCAGT GAGGTGATCG CTGGAATCAT CGAGGATTGC
TTCTTCGCTT TAGATGCCCA GCCAGTACGG GTTCATGCAG CGGATGTTCC GGTGCCTTAC
AACTATAGCC TGGAGAAGGC TGCGATTCCT GATGCTAAGG ATGTCTACCA GAGTGCCCTT
AAGGTATTGG GAAAAGTTTA G
 
Protein sequence
MAELAYWEAL RRAHDEELAH DPLVIAMGED IGVAGGTYKV TLGLYGKYGE ERIIDTPISE 
NSYTGIGIGA SMAGMRPIIE IMSINFALLA LDTLINAAAK IRYMSGGRAQ CPIVMRTPGG
TAHQLAAQHS ARLSRLFMGT PGLRVVTPST PLDAYGMLKS AVRCNDPVIF LEHESMYNLK
GEVPDEETFR PLEGAGVVRE GTDITLIGYN YSVHWCLTAA DKLAQEGIHA EVIDLRSLKP
IDRETIRRSI EKTHRVLVAE EDEAPVGVGS EVIAGIIEDC FFALDAQPVR VHAADVPVPY
NYSLEKAAIP DAKDVYQSAL KVLGKV