Gene Nham_2976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_2976 
Symbol 
ID4033126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp3275510 
End bp3277000 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content61% 
IMG OID637971414 
Productsulfatase 
Protein accessionYP_578196 
Protein GI92118467 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCCG ATGAAAGTGG CCCGAACCGA CGAGACGTCC TGCTGGCGGG CACGGCTCTC 
GCGGCGGCTT CGGCAATTGC GGCGGACGGC ACGACGAGCG CCGCGCAGGC GCAACAGCAG
CCTGCGCGCA CATCGAACGG GCCACCGAAT GTCGTCTATT TTCTCGTCGA CAATCTCGGC
TATGGTGAGT TGGGCTGCTA CGGCGGCGGA ATCCTTCGGG GCGCGGATAC GCGTCGGATC
GATGCGTTCG CAGACGAAGG CATCAAGCTG CTCAATTTCG CACCCGAGGC GCAGTGCACG
CCTTCCCGAT CGGCTTTGAT GACGGGCCGG TACGCGATCC GCTCCGGCAA TCACACGGTT
GCCCTTCCGG GCGAAGAAGG CGGCCTGGTC GCCTGGGAGC GGACGATGGG GGACGTGCTG
TCGGCGCGGG GTTATGCCAC CGCCTGCGTT GGCAAGTGGC ATGTCGGCGA ATCGGCCGGA
CGCTGGCCAA CCGATCACGG TTTCGACGAA TGGTACGGCC CGCCACGCTC CTGGGATGAA
TCCCTTTGGC CAACGGACCC CTGGTACGAT CCCAAACGCG ATCCCGTCAG CAACATGCTG
GAGTCCCGTA AGGGCGATCG GACACCGCGA ACCGTCAAGC AACTCGATCT CAACGTGCGC
CGCGATGTCG ACCGCGAACT CCTGACGCGC GGCAAGGCCT TCATGAAGCG GAGCGTCGAT
GCAAAACGAT CGTTCTTCCT CTATTTCAAT CACTCCCTGA TGCACATGCC GACGATTCCG
CGCGCCGAGT TCAGGGGCAA GTCCGGTCAG GGCGACTGGG CGGATTGTCT GCTCGAGCTT
GACTCGGATT TCGGCGAGAT CCTGGACACG CTGAAGGAAC TCAAGGTCGA CGACAACACC
ATCGTCGTAT TCTCAGGAGA CAACGGGCCT GAGGAGCTGG AGCCCTGGCG CGGTACCCCC
GGCTTTTTCG ATGGCTCCTA CTTCACCGGC ATGGAAGGCT CGTTGCGCAC GCCCTGCATG
GTGCGCTATC CCGGCCGGGT GCCTCCAGGT AAGCAGAGCA ATGATATCGT CCACATCACC
GACATGTTCA CGATAATTCT GCAATGGGCC GGTGCCGCGA TGCCGACGGA CCGCGTGATC
GACGGCATCG ATCAGCGCGC CTTTTTCGAA GGAAAGCAGA ACAATTCAGC GCGCGACGGC
ATCCCGTACT GGATGGCCGA CACGCTTTAC GGCGTGAAGT GGCGCAACTT CAAGATGGTA
TTCTATCTCC AGAAGACGCT CACTGAGCCG GCGTTGAAAC TGTCCACGCC GCACATCATC
AACCTGACCG TCGATCCCAA GGAACGCAAG GCGTTCGATC TGCCTTACAT TCATTCGTGG
ACGGCCGCCC ACTTCGGTAG GATCATGAAG GACTTTGCGA TCAGCGTGAA GCGCGAACCG
CTGATCCCGG CCGGAGCGCC TTTGGACTAT GTTCCTGTTC GGAAAACCTA G
 
Protein sequence
MKADESGPNR RDVLLAGTAL AAASAIAADG TTSAAQAQQQ PARTSNGPPN VVYFLVDNLG 
YGELGCYGGG ILRGADTRRI DAFADEGIKL LNFAPEAQCT PSRSALMTGR YAIRSGNHTV
ALPGEEGGLV AWERTMGDVL SARGYATACV GKWHVGESAG RWPTDHGFDE WYGPPRSWDE
SLWPTDPWYD PKRDPVSNML ESRKGDRTPR TVKQLDLNVR RDVDRELLTR GKAFMKRSVD
AKRSFFLYFN HSLMHMPTIP RAEFRGKSGQ GDWADCLLEL DSDFGEILDT LKELKVDDNT
IVVFSGDNGP EELEPWRGTP GFFDGSYFTG MEGSLRTPCM VRYPGRVPPG KQSNDIVHIT
DMFTIILQWA GAAMPTDRVI DGIDQRAFFE GKQNNSARDG IPYWMADTLY GVKWRNFKMV
FYLQKTLTEP ALKLSTPHII NLTVDPKERK AFDLPYIHSW TAAHFGRIMK DFAISVKREP
LIPAGAPLDY VPVRKT