Gene Nmul_A0239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0239 
Symbol 
ID3785725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp255239 
End bp256657 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content53% 
IMG OID637810314 
Productsugar transferase 
Protein accessionYP_410939 
Protein GI82701373 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGAGG CAAATCCTAC ATTTTCCGCT TCAACCGTTA CCGCCCGCCA GGTGAAAGGC 
CCGCTGGCTA ATAATATGCT GAGCGCGGTC GAGTCGGTAC TTGATCCGGC AGTACTGACA
CTTTCGTTAT GGCTGATAAG CACCAGTTTT GAAGGCGAAT TGCTCCCGCC TTATTTGATC
CTGTCGGTGA TCGTATTCTC TATCTCCTTT CCAGGTGCGT CACATCTCCG GTCATCCGTC
AAGGTTCTCA TCTTCGACGT GCTCTACACC TGGTTCTGGA TGGCTTTGCT ACTGTTTTTT
CTCGGTTTTG CGACCGGCTA TATCGCCCAA TTCTCCAGCC AGGTGCTCAT TACATGGCTA
TGGGCTGCCC CCCTAAGTCA AATCGGCGTC CATCTGCTGT TTCGTATCGC CGCACCCTAT
CTTCTCATGC TCCAGGGTCC GCCACAGCGC GCCATCATCG TCGGCATGAA CGAGCAGGGG
GTCGCCCTCG CCCGTCGTAT TCATGAGACG CGTTATTCGA ATATGGAACT GTCCGGTTTT
TTTGATGATC GGAATGAGAG CCGACTTTCG CATGCGCGGA ATAATCCCTT GCTCGGAAGG
CTTCGAGAGC TTCCCGAATT TGTCAAGGAG CATCGCATCC AGTTCATTTA TCTGTCGTTG
CCGATGGCTT CCCAGCCACG CATCCTGCAT GTTCTCGAAG AACTGAAAGA TACGACGGCC
TCCATCTATT TCGTACCCGA CATGTTTATT ACGGATCTTA TTCAGGCGCG AAGTGGCACA
GTGTGTGGTA CGCCGGTTAT CGCTGTCTGC GAATCGCCTT TTACGGGCTC CAATGGGATG
GTCAAGCGCG CAAGCGACAT TATCCTCTCC CTGCTTATAC TGCTACTCAT ATCACCACTT
CTGCTGCTCA TCGCGGTTGC CATCCGGCTG GATTCGCCAG GGCCCATTAT TTTCAAGCAG
CGACGCTATG GGCTGGACGG AGAGGAAATC CTTGTTTACA AGTTTAGATC GATGAGCGTC
TGCGAAGATG GGAATACCAT CCGGCAAGCA CAAAGGAATG ATAACCGCAT AACCCGCGTC
GGGGCTTTCC TGCGAAGCAA TTCCCTGGAT GAATTGCCGC AGTTCATCAA TGTATTGCAG
GGGCGCATGA GCATCGTGGG CCCCAGGCCT CATGCAGTAG CGCACAACGA GATATATCGC
AACCTTATAA AAGGCTACAT GATCCGGCAC AAGGTAAAGC CGGGAATTAC AGGTTGGGCG
CAGGTGAACG GCTACCGTGG GGAGACGCGG ACCCTGGACA AGATGCAGGC GCGTATCGAT
CATGATCTCG ATTATTTACG CAACTGGTCG TTGCGGCTCG ATTTGCACAT CATCTGCAAG
ACTATCCTGG TAGTACTGAA AGATCGGGCT GCATATTAA
 
Protein sequence
MAEANPTFSA STVTARQVKG PLANNMLSAV ESVLDPAVLT LSLWLISTSF EGELLPPYLI 
LSVIVFSISF PGASHLRSSV KVLIFDVLYT WFWMALLLFF LGFATGYIAQ FSSQVLITWL
WAAPLSQIGV HLLFRIAAPY LLMLQGPPQR AIIVGMNEQG VALARRIHET RYSNMELSGF
FDDRNESRLS HARNNPLLGR LRELPEFVKE HRIQFIYLSL PMASQPRILH VLEELKDTTA
SIYFVPDMFI TDLIQARSGT VCGTPVIAVC ESPFTGSNGM VKRASDIILS LLILLLISPL
LLLIAVAIRL DSPGPIIFKQ RRYGLDGEEI LVYKFRSMSV CEDGNTIRQA QRNDNRITRV
GAFLRSNSLD ELPQFINVLQ GRMSIVGPRP HAVAHNEIYR NLIKGYMIRH KVKPGITGWA
QVNGYRGETR TLDKMQARID HDLDYLRNWS LRLDLHIICK TILVVLKDRA AY