Gene Noc_2562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2562 
Symbol 
ID3704565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2913927 
End bp2915180 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content49% 
IMG OID637739041 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_344545 
Protein GI77166020 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAGA TCCGGAATTT TACTCTTAAT TTTGGCCCGC AGCACCCAGC AGCCCATGGG 
GTTTTGCGGT TAGTGCTAGA AATGGATGGA GAGATTATCC AGCGGGCTGA TCCTCACGTG
GGTCTGTTGC ATCGGGCTAC AGAAAAACTC GCTGAGAGCA AGCCGTTCAA TCAGAGCATC
GGGTATATGG ACCGGTTAGA CTATGTGTCC ATGATGTGCA ATGAACACGG TTATGTGAAA
GCGATTGAAA CACTCCTCGG CATTGAGCCA CCCTTACGGG CACAGTATAT TCGGACGATG
TTCGATGAAA TTACCCGTAT TCTTAATCAT CTCATGTGGC TGGGTGCCCA TGGCCTAGAT
ATCGGCGCCA TGACGGTGTT TTTATACTGC TTCCGGGAGC GGGAGGATCT TATGGACTGC
TATGAAGCGG TTTCAGGAGC TCGTATGCAT GCGACTTATT ATCGTCCCGG TGGGGTTTAT
CGCGACTTGC CGGAGACGAT GCCTGGATAT CAGCCTTCTA AGTGGCATAA CGAAAAAGAG
GTGGCAATAA TTAATCGGAA TCGAGAGGGA TCTTTGCTTG ATTTTATTGA AGATTTTACC
GCCCGCTTTC CTACTTGTGT GGATGAATAT GAAACCCTGT TGACGGATAA TCGAATCTGG
AAACAGCGTA CGGTTGGCAT TGGAGTCGTC ACGCCGGAGC GTGCATTGCA ACTGGGTTTT
ACGGGACCGA TGCTGCGTGG CTCGGGAGTG GAATGGGATT TACGGAAAAA ACAACCCTAT
GCTGCGTATG ACCAAATAGA TTTTGATATT CCTGTGGGGG TTAACGGGGA CTGCTATGAC
CGTTATTTGG TACGGATAGA AGAGATGCGC CAGTCCAACC AAATTATCAA GCAGTGCGTG
GACTGGCTAC GAAAAAATCC TGGACCGGTT ATAGTGAATA ACTATAAGGT TGCTGCCCCC
CCGCGGGAGA AAATGAAAAA TGATATGGAG GTGCTGATCC ATCATTTCAA GCTATTTACC
GAAGGGTATT GCGTCCCGGA AGGTGAAGCT TATGCGGCAG TAGAAGCACC CAAGGGTGAA
TTTGGGGTTT ATCTTATCTC GGATGGGGCT AATAAACCCT ACCGGCTTAA AGTTCGAGCG
CCGGGGTTTG CCCATTTAGC CGCTATGGAC GAAATGGTAC AAGGCCACAT GCTTGCCGAT
GTGGTTGCCA TTATCGGAAC CATGGATATC GTGTTTGGGG AGATCGATCG GTGA
 
Protein sequence
MAEIRNFTLN FGPQHPAAHG VLRLVLEMDG EIIQRADPHV GLLHRATEKL AESKPFNQSI 
GYMDRLDYVS MMCNEHGYVK AIETLLGIEP PLRAQYIRTM FDEITRILNH LMWLGAHGLD
IGAMTVFLYC FREREDLMDC YEAVSGARMH ATYYRPGGVY RDLPETMPGY QPSKWHNEKE
VAIINRNREG SLLDFIEDFT ARFPTCVDEY ETLLTDNRIW KQRTVGIGVV TPERALQLGF
TGPMLRGSGV EWDLRKKQPY AAYDQIDFDI PVGVNGDCYD RYLVRIEEMR QSNQIIKQCV
DWLRKNPGPV IVNNYKVAAP PREKMKNDME VLIHHFKLFT EGYCVPEGEA YAAVEAPKGE
FGVYLISDGA NKPYRLKVRA PGFAHLAAMD EMVQGHMLAD VVAIIGTMDI VFGEIDR