Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2520 |
Symbol | |
ID | 3786645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2884056 |
End bp | 2885261 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637812611 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_413201 |
Protein GI | 82703635 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | [TIGR03088] sugar transferase, PEP-CTERM/EpsH1 system associated |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.303259 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCCAA CCGTTAGTGA CAGGCTGCGG TTGAGTCCCG CGGAACACGT CAAGCAGCCG CCTCTCATTG CTCATGTGAT TTATCATCTG GGGGTGGGGG GACTGGAAAA TGGCGTAGTC AATCTCATCA ATCATATTCC ACCCGATCGC TACCGGCATG CCATAGTTTG CCTGAAGGGG TATTCGGACT TCAGGAGCAG GATTGTCAGC GAAGACGTGG AGGTTATCGC GTTGAACAAG CGCGATGGAC ATGATTTCAA GCTATATATC GATCTTTTCA GAGCGCTCAG GCGCTTGAAG CCCGACATCG TGCATACCCG TAATCTGGCT GCCATGGAAG GTCAGGTGAT TGCAGCTCTT GCGGGGGCGC GGGCAAGAGT CCATGGCGAG CATGGGCGGG ATATGTTCGA CCTGCATGGT AAAAACCGTA AATATAATTT ATTGAGAAAA GCGATTCGTC CGTTTATAAA CCATTTCATC ACCGTCAGCA GGGATCTCGA AAGCTGGCTT GTCGATACGG TACGGGCAGC GCCGCATCGC ATCAATCAAA TCTACAATGG CGTAGACAGC CGACGCTTTT ATCCGCGTAA AAGCACATCC TTGAAGAACA ATAGGGTTCA GGGAGCGATT CCGGGATTTT TCAGGGAAGA TGCCTTTGTC ATTGGCAGTG TCGGCCGCAT GGCAGATGTG AAGAATTACC TCGGCCTGAT AGAAGCATTT TTACTTTTGC TGAAGGAAAT GCCTGCGGCT CACGAAAGAC TTCGGTTATT GATTGTCGGG GCGGGGAGTA CCCGGCAGCG CTGCATTGAA AAGGTGCGTG AAGCGGGAAT CGAAGGACTT GTCTGGTTTC CCGGTGAACG GGACGACATT CCTGAACTCA TGCGCAGCAT GGATCTGTTT GTGCTTCCTT CGCTTGGAGA GGGCATTTCC AACACCATTC TCGAGGCTAT GTCTACCGGC TTGCCCGTCG TCGCCACCCG GGTGGGAGGA AATGCGGAAC TGGTTGAGGA AGGCATGACA GGAATGCTGG TTCCGCCGGG ATCGGCAACT GCGCTGGCAG GAGCCATACA GGAGTATTAC AGAAATCCGG AGCTGTTGAT AGAACACGGC CGCGCTGCCC GAAAGCAGGT CGAGGCAAGG TTCAGTATGG AAGCCATGAT GTCCGGATAT CTTGAAGTCT ATGACCGAGT GTTACGTAGG GTATAA
|
Protein sequence | MQPTVSDRLR LSPAEHVKQP PLIAHVIYHL GVGGLENGVV NLINHIPPDR YRHAIVCLKG YSDFRSRIVS EDVEVIALNK RDGHDFKLYI DLFRALRRLK PDIVHTRNLA AMEGQVIAAL AGARARVHGE HGRDMFDLHG KNRKYNLLRK AIRPFINHFI TVSRDLESWL VDTVRAAPHR INQIYNGVDS RRFYPRKSTS LKNNRVQGAI PGFFREDAFV IGSVGRMADV KNYLGLIEAF LLLLKEMPAA HERLRLLIVG AGSTRQRCIE KVREAGIEGL VWFPGERDDI PELMRSMDLF VLPSLGEGIS NTILEAMSTG LPVVATRVGG NAELVEEGMT GMLVPPGSAT ALAGAIQEYY RNPELLIEHG RAARKQVEAR FSMEAMMSGY LEVYDRVLRR V
|
| |