Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_2976 |
Symbol | |
ID | 4033126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | - |
Start bp | 3275510 |
End bp | 3277000 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637971414 |
Product | sulfatase |
Protein accession | YP_578196 |
Protein GI | 92118467 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCCG ATGAAAGTGG CCCGAACCGA CGAGACGTCC TGCTGGCGGG CACGGCTCTC GCGGCGGCTT CGGCAATTGC GGCGGACGGC ACGACGAGCG CCGCGCAGGC GCAACAGCAG CCTGCGCGCA CATCGAACGG GCCACCGAAT GTCGTCTATT TTCTCGTCGA CAATCTCGGC TATGGTGAGT TGGGCTGCTA CGGCGGCGGA ATCCTTCGGG GCGCGGATAC GCGTCGGATC GATGCGTTCG CAGACGAAGG CATCAAGCTG CTCAATTTCG CACCCGAGGC GCAGTGCACG CCTTCCCGAT CGGCTTTGAT GACGGGCCGG TACGCGATCC GCTCCGGCAA TCACACGGTT GCCCTTCCGG GCGAAGAAGG CGGCCTGGTC GCCTGGGAGC GGACGATGGG GGACGTGCTG TCGGCGCGGG GTTATGCCAC CGCCTGCGTT GGCAAGTGGC ATGTCGGCGA ATCGGCCGGA CGCTGGCCAA CCGATCACGG TTTCGACGAA TGGTACGGCC CGCCACGCTC CTGGGATGAA TCCCTTTGGC CAACGGACCC CTGGTACGAT CCCAAACGCG ATCCCGTCAG CAACATGCTG GAGTCCCGTA AGGGCGATCG GACACCGCGA ACCGTCAAGC AACTCGATCT CAACGTGCGC CGCGATGTCG ACCGCGAACT CCTGACGCGC GGCAAGGCCT TCATGAAGCG GAGCGTCGAT GCAAAACGAT CGTTCTTCCT CTATTTCAAT CACTCCCTGA TGCACATGCC GACGATTCCG CGCGCCGAGT TCAGGGGCAA GTCCGGTCAG GGCGACTGGG CGGATTGTCT GCTCGAGCTT GACTCGGATT TCGGCGAGAT CCTGGACACG CTGAAGGAAC TCAAGGTCGA CGACAACACC ATCGTCGTAT TCTCAGGAGA CAACGGGCCT GAGGAGCTGG AGCCCTGGCG CGGTACCCCC GGCTTTTTCG ATGGCTCCTA CTTCACCGGC ATGGAAGGCT CGTTGCGCAC GCCCTGCATG GTGCGCTATC CCGGCCGGGT GCCTCCAGGT AAGCAGAGCA ATGATATCGT CCACATCACC GACATGTTCA CGATAATTCT GCAATGGGCC GGTGCCGCGA TGCCGACGGA CCGCGTGATC GACGGCATCG ATCAGCGCGC CTTTTTCGAA GGAAAGCAGA ACAATTCAGC GCGCGACGGC ATCCCGTACT GGATGGCCGA CACGCTTTAC GGCGTGAAGT GGCGCAACTT CAAGATGGTA TTCTATCTCC AGAAGACGCT CACTGAGCCG GCGTTGAAAC TGTCCACGCC GCACATCATC AACCTGACCG TCGATCCCAA GGAACGCAAG GCGTTCGATC TGCCTTACAT TCATTCGTGG ACGGCCGCCC ACTTCGGTAG GATCATGAAG GACTTTGCGA TCAGCGTGAA GCGCGAACCG CTGATCCCGG CCGGAGCGCC TTTGGACTAT GTTCCTGTTC GGAAAACCTA G
|
Protein sequence | MKADESGPNR RDVLLAGTAL AAASAIAADG TTSAAQAQQQ PARTSNGPPN VVYFLVDNLG YGELGCYGGG ILRGADTRRI DAFADEGIKL LNFAPEAQCT PSRSALMTGR YAIRSGNHTV ALPGEEGGLV AWERTMGDVL SARGYATACV GKWHVGESAG RWPTDHGFDE WYGPPRSWDE SLWPTDPWYD PKRDPVSNML ESRKGDRTPR TVKQLDLNVR RDVDRELLTR GKAFMKRSVD AKRSFFLYFN HSLMHMPTIP RAEFRGKSGQ GDWADCLLEL DSDFGEILDT LKELKVDDNT IVVFSGDNGP EELEPWRGTP GFFDGSYFTG MEGSLRTPCM VRYPGRVPPG KQSNDIVHIT DMFTIILQWA GAAMPTDRVI DGIDQRAFFE GKQNNSARDG IPYWMADTLY GVKWRNFKMV FYLQKTLTEP ALKLSTPHII NLTVDPKERK AFDLPYIHSW TAAHFGRIMK DFAISVKREP LIPAGAPLDY VPVRKT
|
| |