Gene Noc_2109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2109 
Symbol 
ID3704419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2425288 
End bp2427996 
Gene Length2709 bp 
Protein Length902 aa 
Translation table11 
GC content55% 
IMG OID637738584 
Productpyruvate/2-oxoglutarate dehydrogenase complex dihydrolipoamide dehydrogenase (E3) component and related enzyme 
Protein accessionYP_344099 
Protein GI77165574 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes
[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.73263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAGT CCTACGTCAT TAAGATGCCT CAGCTCTCGG ATACCATGAC CGAGGGCGTA 
CTGGTTTCTT GGGAAAAAGA GATTGGCGAA TTTATCGAGC GTGGTACGGT GGTAGCAACG
GTGGAAACGG ATAAAGCCAT CATGGATGTG GAAGTATTCC GCGAGGGTTA TCTTTCTGGG
CCTCAGTTAC CCGTGGATGG AGTCGTTGCT GTCGGTGAGC CTATCGCTTA TTTAGTAGCG
GAGGCGGAAC AAGTTGTATC CACCGAGGCT GATTCCAGCC CTAAGCCAGC GCCCGAGGTT
GATGAGCCGC CCAAGTTTGA GCCTGCCGGC GCTTCTAAAC CCAAAACCAG GATTCCGGCT
ATGCCGGAAG GTGCAACTCC TGCGCCCCAT CCAAGTCATA CCCGTGCTAC TCCCTATGCG
AGGCAGCTTG CTGGTGCCCA TGGCATTGAC CTGGCAGGAG TTAAGGGCAG TGGTTCGGCG
GATGTTATCG TGGCGGCTGA TGTGGTGAGT GGGGAGGGCG CCAAGGGGAT GACCCGGCGT
ATCTTTAAGC TTCCAGGTGC GGGCCGGCCC ATGGACAGTA TGGAAAAGGC GATTGCCCAT
AATATGGAGT ACTCCCTTTC CATGCCCCTG TTTCGAGCGA CGGTTCACGT GGATCCCTCC
CGACTCGTAG CGGCAGCTAA AAAACAGGGG AGCTCGGTGA CCGTTGCTTT GGCTAAGGCA
ACGGCTTTAG CCATTGAAGA ACACCCTAAG ATCAATAGCG TTTACCAGCA TGAGGATCGA
ATTCTGGAGC GGGAGCAGAT CGATGTGGGG TTGGCGGTGG CAACGGAAGG CATGGGCCTG
GTGGTGCCGG TACTGCGGGA TACATCTCAT CGTGATATCG CCGATCTGAG CGCTGCTTGG
ATCGATCTGG TGGAACGGGC ACGGATTAAA CGGTTAAAAC CGGAAGAGTA TTCCAATCCG
ACCTTTGTGA TTTCCAATAT GGGAATGTTA GGGGTGGCTT ATTTTGATGC CATCCCCTCG
CCGGGAACCT CCGCCATTTT AGCGATTGCG ACCACCGGCC CTCAGGGAAT GCCCGTAACC
ATCACTGCGG ATCATCGAAT TGTCAACGGT GCCGATGCTG CCCGTTTTCT GAATACCTTC
AAGGAGCGAG TAGAGCACCC GGAAACCTGG ATCAGCGGTA GTGGCGGTGA CGGTACCGTG
CCAACCCCCA AGAAATCGCC AAAAGCTCCC CTGCCCTTGG AGGGAGACTG GGATTATGAT
GTGGTAGTGA TTGGCGCAGG CCCTGGAGGA GAGGATTGCG CTCGGGAATT GGTGGAACAC
GGTCTTAAAG TTGCTCTTAT TAATGATTCT CCTTTGCCGG GAGGTGAGTG TTTGTGGCGG
GGCTGCATTC CTTCCAAAAC TTGGCGCGCA GCAGCTGATC GCATCCGGGA CCGAGTCCAT
GATGCCCGGC TAGGAGTGGA AGGTACGGCC CCCACAGCGC TTAGTTGGAA AACGCTGGAG
GCTACTCGCC GCCAACTGCT CCAATCTCGG GGCGAGATGG CGCTAAAAGC TGATAAGGGG
ATGAAAATCA AATTCATCCA GGGCCATGCC CGTTTTGCCG ATGAGCATCA CGTTGAGGTT
GTTACCGCAG GCAATAGTGA TGATCCTTTT AGCCGTACTC AACCTGGATC GAATTCTCCA
AGCCAAAAAA TAAGTTTTGC CGGAGCCGTG ATCGCCACCG GCGCGCCACC TTTTATTCCG
CCTATTCCTG GCGCACAGGA AGGCTTGAGG GAAGGAGGAG TGCTTACCTC CGACACAGTT
TGGGGCCTGG AACAGATCCC TAAACGCTTG GCGGTGATTG GCGGCGGGGC CATTGGCGTG
GAGATGGCAC AAATTTTCCA GGATTTTGGT TCTGAAGTCC TATTACTGGA AGCCCAGGAT
CGGTTGCTGG CCGAGGTGGA GCCAGAAGTG GGTAAATTGC TGGCAGGCGT GCTCAATGCC
GATCCACGAC TTACGGTGCA GACCTCTACC AAGGTCCAGG CTATCAGCGG CCAACCAGGA
GCTATGGAAG TTAGCTTTGA TGATGGGGAG GGCGCTAGCC ATCGGCTGGA AGTGGACCAT
GTCATTATGG CGACTGGGAA ACGACCCCAT CTAGAGCCCT TGGCCTTGGA CCAGGCGGGT
GTGGCTACGG AAAATGGTGC CATTCGAGTG GACGCCCAAT GTACGACTTC CAAACCTCAT
ATTTTTGCAG TGGGCGATGT TATTGGCGGT TTGATGTTAG CTCATACCGC AGCGCAGCAA
GGGCGGGTAG CAGCAGCTAC CATTTTAGGC GAGGCCCATG CCTACGAGTT GGAAAAGGAT
TGCGGGGTAA TTTTTACCCG TCCCCAAGCG GCTTTCGTGG GCCTATCGCT GGTCCAGGCC
AAGGAAAGGG GAATAGATGC AGCGGAAGTG AAAATGCCTA TCCGCATTGA TGCCAAGGCG
ATGATCAGTA ATGAAACCGA GGGCCTCATT AAAATAGTGG CAGATAAAGA TAGTCACCGT
ATTATCGGGG TCCATTTCTT GGCAGATCAC GCCGATACCT TGATTGGGGA AGCCGTCATG
ATGGTGGCGG GGAACATGAC TTTGGAGCAG GTCGCCCGCG CTATTCATCC CCATCCGACC
CAGACCGAGA TGTTTGGAGA AATGGCACGG CGTTTACTCT CGCGCTTGCG CCGTACCCAA
CGGCGATAG
 
Protein sequence
MTESYVIKMP QLSDTMTEGV LVSWEKEIGE FIERGTVVAT VETDKAIMDV EVFREGYLSG 
PQLPVDGVVA VGEPIAYLVA EAEQVVSTEA DSSPKPAPEV DEPPKFEPAG ASKPKTRIPA
MPEGATPAPH PSHTRATPYA RQLAGAHGID LAGVKGSGSA DVIVAADVVS GEGAKGMTRR
IFKLPGAGRP MDSMEKAIAH NMEYSLSMPL FRATVHVDPS RLVAAAKKQG SSVTVALAKA
TALAIEEHPK INSVYQHEDR ILEREQIDVG LAVATEGMGL VVPVLRDTSH RDIADLSAAW
IDLVERARIK RLKPEEYSNP TFVISNMGML GVAYFDAIPS PGTSAILAIA TTGPQGMPVT
ITADHRIVNG ADAARFLNTF KERVEHPETW ISGSGGDGTV PTPKKSPKAP LPLEGDWDYD
VVVIGAGPGG EDCARELVEH GLKVALINDS PLPGGECLWR GCIPSKTWRA AADRIRDRVH
DARLGVEGTA PTALSWKTLE ATRRQLLQSR GEMALKADKG MKIKFIQGHA RFADEHHVEV
VTAGNSDDPF SRTQPGSNSP SQKISFAGAV IATGAPPFIP PIPGAQEGLR EGGVLTSDTV
WGLEQIPKRL AVIGGGAIGV EMAQIFQDFG SEVLLLEAQD RLLAEVEPEV GKLLAGVLNA
DPRLTVQTST KVQAISGQPG AMEVSFDDGE GASHRLEVDH VIMATGKRPH LEPLALDQAG
VATENGAIRV DAQCTTSKPH IFAVGDVIGG LMLAHTAAQQ GRVAAATILG EAHAYELEKD
CGVIFTRPQA AFVGLSLVQA KERGIDAAEV KMPIRIDAKA MISNETEGLI KIVADKDSHR
IIGVHFLADH ADTLIGEAVM MVAGNMTLEQ VARAIHPHPT QTEMFGEMAR RLLSRLRRTQ
RR