Gene Noc_1435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1435 
Symbol 
ID3706043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1591249 
End bp1592325 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content59% 
IMG OID637737925 
Productpyruvate dehydrogenase (lipoamide) 
Protein accessionYP_343454 
Protein GI77164929 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03181] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTAAGG TCGCCTATTT CGAAGTCCAT TACACTCAAT GTCTTAATGA AAACGCCGAA 
GTTATCGGCG TCCTTCCCTC TTTCGCCCAC GACCCCCAGA ACCTGATCCC CCTCTATCGG
ACCATGACCC TGACCCGCCT CTTTGACAAG AAAGCGGTCT CATTACAACG GACGGGGCAA
CTCGGAACCT ACGCTTCCTC ACTGGGACAA GAGGCCATCT CCGTGGCCAT TGGCCATGTT
ATGAGCGCCG ATGATGTGCT CCTAACGACC TATCGGGAAT ACGGCGCCCA ATTGCAGCGG
GGCGTCACCA TGACCGAGCT TCTCCTTTAT TGGGGCGGCG ATGAACGGGG CATGGCCTAC
CAGGACTGCC GCCATGATTT CCCGATTTCC GTTCCGGTGG CAAGCCAAGT GCCCCATGCG
GTGGGAGTCG CCTATGCCAT GAAACTACGG CGCGAGCCGC GGGTGGCGGT ATGCGTACTA
GGGGATGGGG CTACCTCGAA AGGAGATTTC TACGAAGCCA TGAATGCCGC TGGCCTCTGG
CGGCTGCCGG TCGTATTTGT CATTAACAAT AATGGTTGGG CCATCTCGGT CCCCTTGGCG
GCCCAAACCC GGACCCAGAC CCTGGCCCAA AAAGCCATTG CCGCTGGTAT TCCCGGCGAG
CAGGTGGATG GCAATGATGT CATCGCCTTG CGGACGCGCA TGGAAAACGC CATCGAAAAA
GCCCGCCGTG GCGAGGGACC CTGCCTCATT GAAGCGCTGA CTTACCGCCT TTGTGATCAC
ACCACCGCCG ATGATGCCAG CCGCTACCGG GAACAGGCAG AGGTGGAGGC CCGCTGGCGC
CTCGATCCCA TTCAACGGCT GCGGACTTAT CTCACGCAAG CCGGCGCCTG GGACGAGGAA
CAGGAACAAA GCCTCCAAAC GGAACTGACC CAGCAGGTAG AGGAAGCTGT CCAAAAATAT
TTGGACACCC CCCCTCAACC CCCGGAAAGC ATGTTTGATG ATCTCTACGA GAGCTTACCC
TCTGCCCCGC GGGAGCAGCG TCAGACAGCT ATCGCCAGGG GGAAATCCCA TGCCTGA
 
Protein sequence
MSKVAYFEVH YTQCLNENAE VIGVLPSFAH DPQNLIPLYR TMTLTRLFDK KAVSLQRTGQ 
LGTYASSLGQ EAISVAIGHV MSADDVLLTT YREYGAQLQR GVTMTELLLY WGGDERGMAY
QDCRHDFPIS VPVASQVPHA VGVAYAMKLR REPRVAVCVL GDGATSKGDF YEAMNAAGLW
RLPVVFVINN NGWAISVPLA AQTRTQTLAQ KAIAAGIPGE QVDGNDVIAL RTRMENAIEK
ARRGEGPCLI EALTYRLCDH TTADDASRYR EQAEVEARWR LDPIQRLRTY LTQAGAWDEE
QEQSLQTELT QQVEEAVQKY LDTPPQPPES MFDDLYESLP SAPREQRQTA IARGKSHA