Gene Noc_2778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2778 
Symbol 
ID3705508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3154833 
End bp3156134 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content55% 
IMG OID637739254 
Producthistidinol dehydrogenase 
Protein accessionYP_344755 
Protein GI77166230 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0141] Histidinol dehydrogenase 
TIGRFAM ID[TIGR00069] histidinol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGAGA TGACGCGGTT AGACATTGCC CAGGGAGATT TTTGGCTCTG TCTGGAGCAG 
CGCTTGGCTT GGGAAGGAGT CGCGGATGAA ACGGTGGTGG CTACCGTGAG GGAGATTCTC
AAGGCGATCC GTTGCCGGGG GGATGAAGCC TTGCTGGAGT ATACCCAACG TTTCGATGGC
CTAGAAATAG CCCGTGCTCC AGAGCTTGAA ATCCCAACCT CTCGTTTACA GGCGGCGTTA
ACAGCCATTT CCAGGGAGCA GCGGGAAGCG CTTCAGGTGG CGGCTGAGCG GATTACCACC
TATCACCGCC ATCAGAAGCA GGAATCTTGG AGTTATACCG AGCCGGATGG TACTTTGCTG
GGTCAGCAGG TGAGGCCCTT GGACCGGGTG GGACTCTATG TACCTGGCGG TAAGGCGGCT
TATCCTTCGT CCGTATTGAT GAATGCCCTC CCGGCAAAGG TAGCAGGCGT TTCCGAACTG
ATTATGGTGG TGCCAACGCC CAAGGGTGAA ATGAACGATC TCGTGCTAGG AGCAGCGGCT
ATTGCGGGTG TGGATCGGGT GTTTACCGTG GGTGGCGCAC AGGCAGTCGC TGCCTTGGCT
TTTGGCACTG AAAGCGTACC CCGGGTGGAT AAAATTGTCG GTCCGGGTAA TATGTATGTT
GCCACTGCTA AAAGTATGGT TTTTGGCCAA GTGGGAATCG ATATGATTGC CGGGCCTTCT
GAGATTTTGG TACTCTGCGA TGGGAAAACC GATCCGGAGT GGATTGCTAT GGATTTATTC
TCCCAGGCAG AGCATGATGA GGCAGCCCAG GCAATTTTAC TCTCCCCCGA CGGTGTTTTT
TTGGATAAAG TCACGGAGGC CATGGCGCGG TTGCTACCTA CTCTGGAGCG ACAGGAGGTT
ATTGCGAATT CCCTGCGGTC CCGCGGTACC TTGATTAAGG TAGAAAATCT GGATCAGGCT
TTGGAAGTTA TCAACTTTAT CGCTCCTGAA CACTTAGAGC TTTCGGTGGA AGATCCTCAA
GCGTTAGCCT CCCGAGTTCG CCATGCCGGA GCTATCTTTA TGGGACGCTA TACTGCTGAA
GCTATCGGCG ATTACTGTGC CGGTTCTAAC CATGTGCTGC CCACCTCACG CACCGCTCGT
TTCAGCTCGC CCTTGGGCGT GTATGACTTT CAGAAGCGCT CCAGCTTGAT TCAATGCTCC
CCCCAGGGTA GCCAAACTTT GGGGCGCGTG GCTTCGGTAT TGGCGCGCGG CGAAGGGCTA
ACAGCCCATG CCCGCTCAGC AGAATACCGC TTAAAAGGGT AA
 
Protein sequence
MVEMTRLDIA QGDFWLCLEQ RLAWEGVADE TVVATVREIL KAIRCRGDEA LLEYTQRFDG 
LEIARAPELE IPTSRLQAAL TAISREQREA LQVAAERITT YHRHQKQESW SYTEPDGTLL
GQQVRPLDRV GLYVPGGKAA YPSSVLMNAL PAKVAGVSEL IMVVPTPKGE MNDLVLGAAA
IAGVDRVFTV GGAQAVAALA FGTESVPRVD KIVGPGNMYV ATAKSMVFGQ VGIDMIAGPS
EILVLCDGKT DPEWIAMDLF SQAEHDEAAQ AILLSPDGVF LDKVTEAMAR LLPTLERQEV
IANSLRSRGT LIKVENLDQA LEVINFIAPE HLELSVEDPQ ALASRVRHAG AIFMGRYTAE
AIGDYCAGSN HVLPTSRTAR FSSPLGVYDF QKRSSLIQCS PQGSQTLGRV ASVLARGEGL
TAHARSAEYR LKG