Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_0921 |
Symbol | |
ID | 4031915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | - |
Start bp | 1017473 |
End bp | 1018819 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637969429 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_576239 |
Protein GI | 92116510 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACCA GTTTCGATCC TGTCGCCGCC AATTCCAACA GTGCCAACGT GACGCCCGGC TATATGTCAG GCTTTGGCAA TAGCTTCGAG ACCGAAGCTC TGCCGGGCGC CTTGCCGATC GGGCGTAATT CACCTCAACG CTGCGCCTAC GGATTGTACG CGGAACAATT GTCCGGCTCG CCGTTCACCG CGCCGCGCGG CGCCAACGAA CGCTCGTGGC TTTATCGGAT CCGCCCCTCG GTTCGTCATT CCGGGCATTT CGTTAAAATC GACGCAAAAC TCTGGCGCAC GGCGCCGTGC CTCGAGCACG ACATGCCGCT CGCGCAACTA CGCTGGGACC CCACGCCGAT TCCAGAGGAC GAACTGACTT TCGTGCAGAG CGTGCGAACG ATGACCACCG CCGGCGACGC CCACACCCAG ACCGGCATGG CCGCGCATAT CTATCTCATT ACGAGATCCA TGGTCGATCA GCATTTTTAC AATGCCGATG GCGAGATGCT GTTCGTACCG CAGGGGGGCA GCCTTCGCTT CGTCACCGAA TTCGGCGTGA TCGACACCGA GCCGGGCGAA ATTGCCGTTA TTCCGCGCGG CGTCAAGTTT CGCGTCGAAA TTCCATCCGG CCCCGCGCGC GGTTATCTGT GCGAGAACTA CGGCGGCGCA TTCACACTGC CTGAACGGGG CCCCATCGGC GCAAATTGTC TTGCCAATTC ACGCGACTTT CTCACCCCTG TTGCCGCTTA TGAGGATGAT GACAAGCCAA CCGAGTTGTT CGTGAAATGG GGTGGAGCGT TATGGTCCAC GGCTCTGCCT CACTCGCCGA TTGATGTCGT GGCATGGCAC GGCAATTATG CGCCCTATAA GTACGATTTA AGGACGTTCT CACCAATCGG TGCAATCGGA TTCGATCACC CCGATCCATC GATCTTCACC GTGCTGACGT CGCCTTCAGA AATCGCCGGC ACCGCGAATA TCGATTTCGT GATTTTCCCG GAGCGATGGG TGGTCGCAGA GAACACGTTC CGTCCACCTT GGTATCATAT GAACGTCATG TCCGAGTTTA TGGGCTTGAT TTACGGCGTT TACGACGCCA AGCCGCAGGG CTTCGTACCT GGCGGCATTA GCCTGCATAA TTGCATGCTT CCTCACGGGC CCGACCGCGA GGCGTTCGAC CATGCCAGCA ATACCGAACT CAAGCCCGTC AAGCTTACCG GCACGTTGGC CTTCATGTTC GAAACACGCT TCCCGCAGCG CGTGACTGAA TACGCCGCAA CATCCGACGC GCTGCAGGAC GACTACGCGG ATTGTTGGCA AGGGCTTGAG CGCCGTTTCG ATCCGACCAG GCCGTAA
|
Protein sequence | MNTSFDPVAA NSNSANVTPG YMSGFGNSFE TEALPGALPI GRNSPQRCAY GLYAEQLSGS PFTAPRGANE RSWLYRIRPS VRHSGHFVKI DAKLWRTAPC LEHDMPLAQL RWDPTPIPED ELTFVQSVRT MTTAGDAHTQ TGMAAHIYLI TRSMVDQHFY NADGEMLFVP QGGSLRFVTE FGVIDTEPGE IAVIPRGVKF RVEIPSGPAR GYLCENYGGA FTLPERGPIG ANCLANSRDF LTPVAAYEDD DKPTELFVKW GGALWSTALP HSPIDVVAWH GNYAPYKYDL RTFSPIGAIG FDHPDPSIFT VLTSPSEIAG TANIDFVIFP ERWVVAENTF RPPWYHMNVM SEFMGLIYGV YDAKPQGFVP GGISLHNCML PHGPDREAFD HASNTELKPV KLTGTLAFMF ETRFPQRVTE YAATSDALQD DYADCWQGLE RRFDPTRP
|
| |