Gene Nham_0921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_0921 
Symbol 
ID4031915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp1017473 
End bp1018819 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content59% 
IMG OID637969429 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_576239 
Protein GI92116510 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCA GTTTCGATCC TGTCGCCGCC AATTCCAACA GTGCCAACGT GACGCCCGGC 
TATATGTCAG GCTTTGGCAA TAGCTTCGAG ACCGAAGCTC TGCCGGGCGC CTTGCCGATC
GGGCGTAATT CACCTCAACG CTGCGCCTAC GGATTGTACG CGGAACAATT GTCCGGCTCG
CCGTTCACCG CGCCGCGCGG CGCCAACGAA CGCTCGTGGC TTTATCGGAT CCGCCCCTCG
GTTCGTCATT CCGGGCATTT CGTTAAAATC GACGCAAAAC TCTGGCGCAC GGCGCCGTGC
CTCGAGCACG ACATGCCGCT CGCGCAACTA CGCTGGGACC CCACGCCGAT TCCAGAGGAC
GAACTGACTT TCGTGCAGAG CGTGCGAACG ATGACCACCG CCGGCGACGC CCACACCCAG
ACCGGCATGG CCGCGCATAT CTATCTCATT ACGAGATCCA TGGTCGATCA GCATTTTTAC
AATGCCGATG GCGAGATGCT GTTCGTACCG CAGGGGGGCA GCCTTCGCTT CGTCACCGAA
TTCGGCGTGA TCGACACCGA GCCGGGCGAA ATTGCCGTTA TTCCGCGCGG CGTCAAGTTT
CGCGTCGAAA TTCCATCCGG CCCCGCGCGC GGTTATCTGT GCGAGAACTA CGGCGGCGCA
TTCACACTGC CTGAACGGGG CCCCATCGGC GCAAATTGTC TTGCCAATTC ACGCGACTTT
CTCACCCCTG TTGCCGCTTA TGAGGATGAT GACAAGCCAA CCGAGTTGTT CGTGAAATGG
GGTGGAGCGT TATGGTCCAC GGCTCTGCCT CACTCGCCGA TTGATGTCGT GGCATGGCAC
GGCAATTATG CGCCCTATAA GTACGATTTA AGGACGTTCT CACCAATCGG TGCAATCGGA
TTCGATCACC CCGATCCATC GATCTTCACC GTGCTGACGT CGCCTTCAGA AATCGCCGGC
ACCGCGAATA TCGATTTCGT GATTTTCCCG GAGCGATGGG TGGTCGCAGA GAACACGTTC
CGTCCACCTT GGTATCATAT GAACGTCATG TCCGAGTTTA TGGGCTTGAT TTACGGCGTT
TACGACGCCA AGCCGCAGGG CTTCGTACCT GGCGGCATTA GCCTGCATAA TTGCATGCTT
CCTCACGGGC CCGACCGCGA GGCGTTCGAC CATGCCAGCA ATACCGAACT CAAGCCCGTC
AAGCTTACCG GCACGTTGGC CTTCATGTTC GAAACACGCT TCCCGCAGCG CGTGACTGAA
TACGCCGCAA CATCCGACGC GCTGCAGGAC GACTACGCGG ATTGTTGGCA AGGGCTTGAG
CGCCGTTTCG ATCCGACCAG GCCGTAA
 
Protein sequence
MNTSFDPVAA NSNSANVTPG YMSGFGNSFE TEALPGALPI GRNSPQRCAY GLYAEQLSGS 
PFTAPRGANE RSWLYRIRPS VRHSGHFVKI DAKLWRTAPC LEHDMPLAQL RWDPTPIPED
ELTFVQSVRT MTTAGDAHTQ TGMAAHIYLI TRSMVDQHFY NADGEMLFVP QGGSLRFVTE
FGVIDTEPGE IAVIPRGVKF RVEIPSGPAR GYLCENYGGA FTLPERGPIG ANCLANSRDF
LTPVAAYEDD DKPTELFVKW GGALWSTALP HSPIDVVAWH GNYAPYKYDL RTFSPIGAIG
FDHPDPSIFT VLTSPSEIAG TANIDFVIFP ERWVVAENTF RPPWYHMNVM SEFMGLIYGV
YDAKPQGFVP GGISLHNCML PHGPDREAFD HASNTELKPV KLTGTLAFMF ETRFPQRVTE
YAATSDALQD DYADCWQGLE RRFDPTRP