Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1783 |
Symbol | |
ID | 3784361 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2036280 |
End bp | 2037287 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637811869 |
Product | glycosyl transferase family protein |
Protein accession | YP_412472 |
Protein GI | 82702906 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0859] ADP-heptose:LPS heptosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCAGA TGGGTAATCT GCCAATCAAG AAGATCGCGA TTTTCAGGGC CTTGAAGCTG GGGGATTTTC TGTTGTTCGT CCCTGCGCTG CGGGCGATTA GGCGGGCTTT TCCGCAAGCC ACCATCGATT ATGTCGGGTT GCCCTGGAAC AAGGCGCTCG CCGCGCGTTA CAATCATTAT ATCGACGAAT TTATCGAATT TCCGGGTTTT CCCGGATTGC CCGAGCATCC CTTCAGGGCT GAAGCAGTCA CGGCTTTTCT GGACGGCATG CAGCGGCGAC AATACGACCT TGCCCTGCAA ATGCATGGCA AAGGTACAGT ATCTAATCTT GTCGTCTCTC TTTTCGGGGC GGCTATTGCG GCCGGGTTTG CGAGCGAAGG CAACTCTCAC TGGCCTAACC GCGATTTCTT CATGCCATAC CCCTCCAGGC AGCCTGAGTT GCTAAGGAAT CTGGCGTTAC TCGAGTTTCT CGGCATGGAG CAGGCTGATC GCGCGGCGGA CAGAACCATG GAATTTCCGT TATTGGACAT GGACTGCCAG AAACTTCGGG AACTGCAAGA ATATGGGACT ATTCGTGATA AACCTTATGT CTGCCTGCAT CCCGGCGCGA TTTCCGCAAC CCCCTGGCCT GCTGCTCATT TCGCGGAGGT GGCGGACAGG TGTATTCGGC AGGGTTTGAA AGTGGTGTTG ACGGGTACTG CGGAAGAGAA GCCGCTCACG CAAGCGGTCG CCGGGAAAAT GACGGGTACG GCGATTGATC TTGCCGGTAA AACCGCCATC GGGGCACTCG CCGCCCTTCT GAAGGGCAGC CGGGCGGTGA TCTCGAATGA CACGGGAGTT GCGCATTTGG CGGTAGCGGT CGATGCCCCG AGTGTCACCG TCTTTACCAC GACCGATCCG CTGATTTGGG GTCCGTTGGA TCAGGTTCAT CATCGGGTTG TTTCGGGAAA CGACGTGAAG ACGCCGGAAA TGGCAATACG GGCGCTGGAG GAATTAATTG GGCGTTAA
|
Protein sequence | MAQMGNLPIK KIAIFRALKL GDFLLFVPAL RAIRRAFPQA TIDYVGLPWN KALAARYNHY IDEFIEFPGF PGLPEHPFRA EAVTAFLDGM QRRQYDLALQ MHGKGTVSNL VVSLFGAAIA AGFASEGNSH WPNRDFFMPY PSRQPELLRN LALLEFLGME QADRAADRTM EFPLLDMDCQ KLRELQEYGT IRDKPYVCLH PGAISATPWP AAHFAEVADR CIRQGLKVVL TGTAEEKPLT QAVAGKMTGT AIDLAGKTAI GALAALLKGS RAVISNDTGV AHLAVAVDAP SVTVFTTTDP LIWGPLDQVH HRVVSGNDVK TPEMAIRALE ELIGR
|
| |