Gene Nham_3301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_3301 
Symbol 
ID4029379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp3644576 
End bp3646264 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content52% 
IMG OID637971713 
Productsulfatase 
Protein accessionYP_578495 
Protein GI92118766 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTGA CCTTGATAAT TGTGGCCGCC GTCACCGTCG TTTGGTTGGC AGAGCGCAGC 
GTTGGGCACC TTAAATTGGC TATTGCCGCG CTGTGTTTTA TTGCTGCGGC GCTGCTGTTC
GTCGTTACCT CTCTCGAGAG AGCCATTCTG CTATCGTCCA TCGTGGTGGC GGCAATCTTC
GGAGCGTCAA GCGTCAAATA CAATCATAGC GGGCTGAAAC TGACAGTAAC GGATTTGCCG
CTAGCGTTTG CAGGAACAGT TCCGTTTTTC ATTGTGCAAT ACCCGCTCGC AGTAATGGCG
GTTCTCGCCG GGGGCATAGG ACTCATTCTG GCCACAGTTG CGACTCTGCT CTATGTGCCC
GGGCTGCCGG TTTCTTTAGA GCTCCAGAGT ATTCTGTTTT TCTTTGCATT CATTGGTTTA
ATCGCGGCGT ACAGGGCAGA CGGCGGAGCC GAATCTCTTC ATCGCATTGC GGCTCAGCGG
CGGTGCTTTT TTTCGACCTT CATAGGCTCG CTTCTCAATC CGCTTTCCTG GCGACAGTTC
AGCGGGCTCG TTTTAAGCGA TATCGCTGAA GACTCGTTGC AGCTGATGCC GGCCATACCA
GCGCGTACTC TCGACTATCC TGACATTATA GTCATCCAAC ACGAATCGAT CTTCGATCCG
CGCGTGTTTG GACTTCCGGT TGAGCCAATT GTCGAAGCAT TTTTGTCTCC GAAAAATGGT
CTGTTTGGAA GCCTTAACGT CGATATTTTT GGCGGAGGAT CGTGGCAATC CGAATTTAGC
TTATTAACAG GCCTGTCCAG CGCAAGCTTT GGTTCAAACG CCTATTTTCT TTTTAAGAGA
GGCGTTGGGC GATTCCACAA TAGCCTTCCA AATGCGCTGA CTGCAATTGG ATATAGAACA
ATGCTCGCGT CGAGTTGCCG TCGCAGTTTT CTCCACTACG ATGAGTTTTA CCGTTCAATC
GGCATCGACG AACGTATTTT TACTGAGGAT TTTCATCCAC CCTTCGATGT TGGTCAATTC
GAAGCGACAA ATTCCGATGC ATTGTTTCTG GACGCAGCAT TCGGTGCTCA TATGGAAAGT
ATGAGTGGCG ATGCCGCGCC TCGTTTTCTA TATGCACTAA CCAACTTCAA TCATGGCCCT
CATAACCGAA GGCTGGTCGC GCCTGGACGT TTTGAGAGAG AGCGCGCCTT CGCTGCCGCA
AGCCTCCCCG ATGCCTACTA TGCTGAATAC TACGCCCGCC TCGTAGAGAC CGCTGTCACC
TGGAATCGAC TCAAGTCCGA GCTTTCAACC CGTTTTCCGA GCCGTCCGGT ACTGATTGTA
CACTACGGAG ATCATCAACC CGTAATGACA CGGCGAATCG AGGCAAAACT GAAGCTCCCC
ATAGACCCGC GGCGCCAGTT CCGTACGTTC TATGCCATAG AAACTCTAAA TGACTGTTCT
GATCGACTTA TCTCTGGGCG GGGCCAAGAC TTGGATATTG CTTTTCTTGG GACCGTCGCT
TTGCAGCAGG CGGGCCTGCC ACTAGATGAG ATCTTTGCCA CGCGCGCGAG CCTTATTGAA
CATTGCGGTG ACGCTTACTT CATATCGTCT TCGGAACGGA AACGCCGTTT CCACCGCACA
CTTGTAGATC TCGGCATGAT CGACGTAGCG TCAGCCAGCG ACGTCCTTCA ACATCGACCA
GTTGCTTGA
 
Protein sequence
MKLTLIIVAA VTVVWLAERS VGHLKLAIAA LCFIAAALLF VVTSLERAIL LSSIVVAAIF 
GASSVKYNHS GLKLTVTDLP LAFAGTVPFF IVQYPLAVMA VLAGGIGLIL ATVATLLYVP
GLPVSLELQS ILFFFAFIGL IAAYRADGGA ESLHRIAAQR RCFFSTFIGS LLNPLSWRQF
SGLVLSDIAE DSLQLMPAIP ARTLDYPDII VIQHESIFDP RVFGLPVEPI VEAFLSPKNG
LFGSLNVDIF GGGSWQSEFS LLTGLSSASF GSNAYFLFKR GVGRFHNSLP NALTAIGYRT
MLASSCRRSF LHYDEFYRSI GIDERIFTED FHPPFDVGQF EATNSDALFL DAAFGAHMES
MSGDAAPRFL YALTNFNHGP HNRRLVAPGR FERERAFAAA SLPDAYYAEY YARLVETAVT
WNRLKSELST RFPSRPVLIV HYGDHQPVMT RRIEAKLKLP IDPRRQFRTF YAIETLNDCS
DRLISGRGQD LDIAFLGTVA LQQAGLPLDE IFATRASLIE HCGDAYFISS SERKRRFHRT
LVDLGMIDVA SASDVLQHRP VA