Gene Nmul_A1791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1791 
Symbol 
ID3784369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2042221 
End bp2043447 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content56% 
IMG OID637811877 
Productglycosyl transferase family protein 
Protein accessionYP_412480 
Protein GI82702914 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02195] lipopolysaccharide heptosyltransferase II 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.306314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGCCG GAGCCTGGAA AAATTACAGG AACATATTAT GCGTTCGCCC GGACAACATG 
GGCGATGTGC TGATGAGCCA GCCGGCGATG CGGGCGCTCA AGGAATCGGT TCCGGGCAGA
AAAATCACTC TGTTGACTTC CACCGCGGGA GCATGCATTG CTCCCTTCAT TCCGGAAGTG
GACGAAACGA TTCCGTTCGA TGTTCCCTGG GTAAAGACGA CCGAAACCAA TGGAGCGCAA
CGGTTGCTGG CGCTTGCCGG CGAATTGCTA GCCCTTCAGC TTGATGCCGC AGTAATCTTT
ACAGTCTATA GCCAGAACCC GTTGCCGGCT GGCATGTTGT GCTATCTGGC GGGTATCAAA
GCCGTTCTGG GCTATTGCCG CGAGAATCCT TACCAGCTTA TCAATCAGTG GGTGCCGGAC
AGGGAACCGC TGGACTACGT CGTCCACGAG GTCGAACGGC AGCTTCGCCT GGTCGAGATG
GCCGGCGCAA AAACCTCCGA TACCCGGCTC CTGCTGAAAA TTCCAGAGGA AACGCGGAAA
GAAGCAACGG ACAGGGTATG GGAGATATTA AATGTAGCCG GCATCCGGGC AGGTATGGGC
TGGCTGGTGC TGCATGCGGG CGTCAGTGAA GAAAAGCGTC TTTATCCGGC CAGAGATTAT
ATAACGGCCT GCCAATCGCT CATCAGGCAA GGCTATAAGA TCCTGCTCAC AGGCAGCGGC
AGCGAGCGCG ATTATGTAGA CCAGATCGCT CGCCAGCTCA GTGATGCCGC GATAAATGTC
GCAGGCGAGC TATCCATTGC AGAACTGATT GTCCTGGTCG AAGCGGCACC CGTCTTGATC
TCGAATAATA CCGGTCCTGT CCATATTGCC ACCGCCGTCG GCACCCCCGT GGTGGTACTG
TACGCGATGA CAAATCCCCA GCACACGCCA TGGCAAGTGC CAAGCAAAGT GCTCTACTTC
GAGGTGCTGC CGGAACTGAG GACAAAGAAC CAGCTGCTTC AGTCCTTCCC CGGATCCTCC
ACACCCAGGG CTTCACCGGA GGCAATAGTG GCGGCTGTAG GTGAGCTTGT TTCACCACGG
AGAAAACACG GATCCTTCAT TCCAAGCCCC CTCCGGAATC CAGTCCCCAC CTGCAATAGC
CTTCCTGACA CTGCACTGGC TGCCTGTTTC ACTCCGCATA ATGAATATGG CATGTGCCCG
CCTATTCGAT ATGGCGAGGA TGTATGA
 
Protein sequence
MFAGAWKNYR NILCVRPDNM GDVLMSQPAM RALKESVPGR KITLLTSTAG ACIAPFIPEV 
DETIPFDVPW VKTTETNGAQ RLLALAGELL ALQLDAAVIF TVYSQNPLPA GMLCYLAGIK
AVLGYCRENP YQLINQWVPD REPLDYVVHE VERQLRLVEM AGAKTSDTRL LLKIPEETRK
EATDRVWEIL NVAGIRAGMG WLVLHAGVSE EKRLYPARDY ITACQSLIRQ GYKILLTGSG
SERDYVDQIA RQLSDAAINV AGELSIAELI VLVEAAPVLI SNNTGPVHIA TAVGTPVVVL
YAMTNPQHTP WQVPSKVLYF EVLPELRTKN QLLQSFPGSS TPRASPEAIV AAVGELVSPR
RKHGSFIPSP LRNPVPTCNS LPDTALAACF TPHNEYGMCP PIRYGEDV