Gene Nmul_A0301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0301 
Symbol 
ID3785334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp325227 
End bp326288 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content55% 
IMG OID637810377 
Productpolysaccharide deacetylase 
Protein accessionYP_411001 
Protein GI82701435 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.351436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCTTGTT TGCTTGCTGC TCGTGTCCGC TTTTCTTCCA TGAGAACTTC TGAATATGAT 
CACGTGGACA ATCTGGTAAA TCAGGTGCCA CTTTGGCCGC TGTTGACGAG CCGTGCGATG
TTGCATCTGC TTTCCCCCGG GGGGCGGCGC GCCCGCCTGT CCATACTGAT TTATCATCGG
GTGCTGCCCT GCCAGGATCC ACTTTTTCCG GAGGAGAGTC ATGCCCGGTC CTTCGATCAG
CAGATGGAGC AGCTTGCGGC CTGTTTCAGG GTCATATCTC TCGGGGAGGC GATTCGCGGC
CTTCGTAATG GAACGCTGCC CCCACGTGCT GCCTGTGTCA CTTTCGATGA CGGTTATGCC
GATAATGCGG AAATCGCTTT GCCGATTCTG AAAAAACGCG GTATTCCTGC TACTTTTTTC
GTGGCAACCG GTTTTCTCGA CGGAGGCCGG ATGTTCAACG ATACCGTGAT AGAATTGATA
CGGGGTGCGC CGGGAAGCAC CGTGGATCTT GATAGCCTGG GCTTGGGAAG GTTTCCGATC
GGAACTGTTT CCGAGCGTCG TCAAACTATT CACCAGTTAT TGGGCAAGCT CAAGTATCTG
CCCTCCGCAT TACGGCAATC GACCGTGGAG GCAATGTCGG CGTCAATTCC AGTCATGCTG
CCTGACAATC TCATGATGAC CTCCGAGCAG GTCAGAATGA TGCACAATGC AGGAATGGAA
ATAGGTGGGC ATACCGCCAG TCATCCCATC CTGGCAAAAA TGGAGAGCAG AGCAGCTTGT
GCTGACATTG CCACCGGCAA AGAGATGCTG GAGGCCATTA TTCGCGCTCC GGTGCGGTTT
TTTGCCTACC CGAATGGGAA GCCGGGGAGA GACTATTTGC CCGATCACGT CCGGATGGTA
AAGAAGCTGG GGTTTGACGC GGCAGTCTCG ACTGCCCATG GAGCAGCAAG AAAAGGAAGC
GACCTCCATC AGCTCCCTCG GTTTACTCCC TGGGATAGGC GCCCGCTGCG ATTCGCCCTT
CTGATGGCGC GAAATATGCT GAAAACTGGA GAAACAGTTT AA
 
Protein sequence
MPCLLAARVR FSSMRTSEYD HVDNLVNQVP LWPLLTSRAM LHLLSPGGRR ARLSILIYHR 
VLPCQDPLFP EESHARSFDQ QMEQLAACFR VISLGEAIRG LRNGTLPPRA ACVTFDDGYA
DNAEIALPIL KKRGIPATFF VATGFLDGGR MFNDTVIELI RGAPGSTVDL DSLGLGRFPI
GTVSERRQTI HQLLGKLKYL PSALRQSTVE AMSASIPVML PDNLMMTSEQ VRMMHNAGME
IGGHTASHPI LAKMESRAAC ADIATGKEML EAIIRAPVRF FAYPNGKPGR DYLPDHVRMV
KKLGFDAAVS TAHGAARKGS DLHQLPRFTP WDRRPLRFAL LMARNMLKTG ETV