Gene Nmul_A0289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0289 
Symbol 
ID3785535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp309931 
End bp311148 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content55% 
IMG OID637810365 
Productglycosyl transferase, group 1 
Protein accessionYP_410989 
Protein GI82701423 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAATTC TGCATATCCT TGATCACTCC ATTCCCCTGC ACAGCGGCTA CACTTTCCGA 
ACGCTTTCGA TTCTGAATGA ACAACGGAAC CTGGGGTGGG AAACCTTTCA TCTGACCGGA
TCGAAGCAGG AAAATTGCAG TGTGCTGGAA GAGTGCGTGG AAGGATGGCA TTTTTATCGT
ACCCCAGCCC CGTCAGGACT GAGGGCACGG CTGCCCGTCT TGAATCAGCT GGCTGTCATG
GAGGCTCTTA CCCATCGCCT GACTGAGGTG GTCAAGATTG TCGAACCGGA TATCCTGCAC
GCGCATTCTC CAGTCCTGAA CGCTTTGCCT GCCTTGCGCG TGGGGCGGAG ATCGGGTATT
CCTGTCGTCT ACGAAGTCCG GGCATTCTGG GAAGATGCGG CCGTAGATCA CGGCACTCAT
CGCGAATGGG GCGCACGATA TCGTCTCACC CGTGAGCTGG AGAGTTACGC GTTAAGGCAT
GTTGATGCAG TAACCACGAT TTGTGAGGGG CTGCGCGGCG ACATCCTCAA GCGGGGTATT
CCGTCAGAAA AGGTAACTGT CATTCCCAAT GCGGTCAATC TCGAAACTTT CAGGATGAGC
GAGCGCGGGG ATTTGCAACT TGCAAACGCA CTCGGGATGG AGGGCAAGGT GTTGCTCGGC
TTCATCGGCT CGTTTTACGC GTATGAAGGA TTGACTGTAC TGCTCAACGC ACTGCCCCGT
ATGTTAGCGG CAAATCCCGA CATCCGCATT CTTCTGGTGG GAGGAGGGCC TCAGGAAGAC
GAATTAAAAT CTCTTACAGC CCGGAGGGGT CTGCAAGGCA AGGTTATTTT CACGGGCCGC
GTTCCCCATG ATCAGGTTCG GCGCTATTAC AATCTGATCG ATATTCTCGT TTATCCGAGA
TTGCCCATGC GCCTTACAGA CCTGGTCACG CCCCTCAAGC CGCTGGAGGC CATGGCGCAA
GGAAGGCTCG TTGCCGCTTC AGATGTAGGC GGACATCTCG AGCTCATCCA GGATGGAAAA
ACCGGAGTGC TTTTCAAGGC TGGCGATTCC GACGCCTTGG CAGCCCGGAT ATTGAATCTC
ATATCCAGCA CCGATACCTG GGATACCCTT CGGGCCGGGG CGCGCGATTT TGTCGAAACC
CAGCGCAACT GGGCTGGCAG CGTGGCCGGC TATAAGGAGA TCTATCGCAC TCTCCTTTCA
AGGAAAGCAT CGTCATGA
 
Protein sequence
MRILHILDHS IPLHSGYTFR TLSILNEQRN LGWETFHLTG SKQENCSVLE ECVEGWHFYR 
TPAPSGLRAR LPVLNQLAVM EALTHRLTEV VKIVEPDILH AHSPVLNALP ALRVGRRSGI
PVVYEVRAFW EDAAVDHGTH REWGARYRLT RELESYALRH VDAVTTICEG LRGDILKRGI
PSEKVTVIPN AVNLETFRMS ERGDLQLANA LGMEGKVLLG FIGSFYAYEG LTVLLNALPR
MLAANPDIRI LLVGGGPQED ELKSLTARRG LQGKVIFTGR VPHDQVRRYY NLIDILVYPR
LPMRLTDLVT PLKPLEAMAQ GRLVAASDVG GHLELIQDGK TGVLFKAGDS DALAARILNL
ISSTDTWDTL RAGARDFVET QRNWAGSVAG YKEIYRTLLS RKASS