Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1791 |
Symbol | |
ID | 3784369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2042221 |
End bp | 2043447 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637811877 |
Product | glycosyl transferase family protein |
Protein accession | YP_412480 |
Protein GI | 82702914 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0859] ADP-heptose:LPS heptosyltransferase |
TIGRFAM ID | [TIGR02195] lipopolysaccharide heptosyltransferase II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.306314 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGCCG GAGCCTGGAA AAATTACAGG AACATATTAT GCGTTCGCCC GGACAACATG GGCGATGTGC TGATGAGCCA GCCGGCGATG CGGGCGCTCA AGGAATCGGT TCCGGGCAGA AAAATCACTC TGTTGACTTC CACCGCGGGA GCATGCATTG CTCCCTTCAT TCCGGAAGTG GACGAAACGA TTCCGTTCGA TGTTCCCTGG GTAAAGACGA CCGAAACCAA TGGAGCGCAA CGGTTGCTGG CGCTTGCCGG CGAATTGCTA GCCCTTCAGC TTGATGCCGC AGTAATCTTT ACAGTCTATA GCCAGAACCC GTTGCCGGCT GGCATGTTGT GCTATCTGGC GGGTATCAAA GCCGTTCTGG GCTATTGCCG CGAGAATCCT TACCAGCTTA TCAATCAGTG GGTGCCGGAC AGGGAACCGC TGGACTACGT CGTCCACGAG GTCGAACGGC AGCTTCGCCT GGTCGAGATG GCCGGCGCAA AAACCTCCGA TACCCGGCTC CTGCTGAAAA TTCCAGAGGA AACGCGGAAA GAAGCAACGG ACAGGGTATG GGAGATATTA AATGTAGCCG GCATCCGGGC AGGTATGGGC TGGCTGGTGC TGCATGCGGG CGTCAGTGAA GAAAAGCGTC TTTATCCGGC CAGAGATTAT ATAACGGCCT GCCAATCGCT CATCAGGCAA GGCTATAAGA TCCTGCTCAC AGGCAGCGGC AGCGAGCGCG ATTATGTAGA CCAGATCGCT CGCCAGCTCA GTGATGCCGC GATAAATGTC GCAGGCGAGC TATCCATTGC AGAACTGATT GTCCTGGTCG AAGCGGCACC CGTCTTGATC TCGAATAATA CCGGTCCTGT CCATATTGCC ACCGCCGTCG GCACCCCCGT GGTGGTACTG TACGCGATGA CAAATCCCCA GCACACGCCA TGGCAAGTGC CAAGCAAAGT GCTCTACTTC GAGGTGCTGC CGGAACTGAG GACAAAGAAC CAGCTGCTTC AGTCCTTCCC CGGATCCTCC ACACCCAGGG CTTCACCGGA GGCAATAGTG GCGGCTGTAG GTGAGCTTGT TTCACCACGG AGAAAACACG GATCCTTCAT TCCAAGCCCC CTCCGGAATC CAGTCCCCAC CTGCAATAGC CTTCCTGACA CTGCACTGGC TGCCTGTTTC ACTCCGCATA ATGAATATGG CATGTGCCCG CCTATTCGAT ATGGCGAGGA TGTATGA
|
Protein sequence | MFAGAWKNYR NILCVRPDNM GDVLMSQPAM RALKESVPGR KITLLTSTAG ACIAPFIPEV DETIPFDVPW VKTTETNGAQ RLLALAGELL ALQLDAAVIF TVYSQNPLPA GMLCYLAGIK AVLGYCRENP YQLINQWVPD REPLDYVVHE VERQLRLVEM AGAKTSDTRL LLKIPEETRK EATDRVWEIL NVAGIRAGMG WLVLHAGVSE EKRLYPARDY ITACQSLIRQ GYKILLTGSG SERDYVDQIA RQLSDAAINV AGELSIAELI VLVEAAPVLI SNNTGPVHIA TAVGTPVVVL YAMTNPQHTP WQVPSKVLYF EVLPELRTKN QLLQSFPGSS TPRASPEAIV AAVGELVSPR RKHGSFIPSP LRNPVPTCNS LPDTALAACF TPHNEYGMCP PIRYGEDV
|
| |