Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1732 |
Symbol | |
ID | 3786209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1979521 |
End bp | 1981101 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637811818 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_412421 |
Protein GI | 82702855 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.532923 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGTTA TCCAGGCAGG GGCGGAGAAG CCCTGGACGG TACGGGCAGC AGGATCGGTT CTGGAACCGG TGGCGGATAT TGCTCTGCTG CTGGAAGGCA CCTATCCTTA TGTCCGTGGC GGGGTATCTT CATGGGTCCA CCAGATCATA AACGGTTTGC CGGAATTTAC CTTCGCCGTG ATCTTTATAG GCGGCAATAA GGAAATGTAT GGACCGGCGC AATATCAGTT TCCGCCGAAT GTGACGCATG TCGAAACGCA CTATCTGTTG CACCGCAGCC ATGGGAAGCC GCATGCCCGC AAGGGCAGCC GCCAGCCTTT CCAGGAAATG GAAGAACTGC ACGCCATCAT GCGGCAATCC GGGAAATTGG CTGACAGGAA AATGATTGAT GCGTTTGCGC GATTGGGCTC CAAGGGTGGC ATCAGTCAGG AGGATTTTCT GTATAGCGAA GCTTCCTGGG ATTACATTAC CCGACAGTAT AAAGAGCGCT GTACTGAACC CTCGTTTGTG GACTATTTCT GGTCGATACG TGCAATGCAT GCACCTTTGT TCGTGCTGGC CGATATTGCG ACCGGGCTTC CACCGGTTCG GGCAGTGCAT GCCATTTCGA CTGGTTATGC AGGCCTGCTG GGCGCGATGA TACGGCTGCG GCGAAACATT CCTTTCGTGC TCACCGAGCA CGGAATCTAT ACCAAGGAGC GGAAAATCGA CTTGGCGCAG GCTACCTGGA TCCATGATCA CAATGACGAT GTATGCAATA CGCTGCATGA GGAAATGGGA TATATCCGGG GCCTCTGGAT CAGATTCTAC GAGCAGCTGG GGCGCATGGC GTACGCGCAG GCTTCGCCCA TTATCAGCCT GTATGAAGGG AATCGGTTGA GACAGATTGC CGACGGAGCA GTCCCGGAAA AAACACGTAT CATTACCAAC GGCCTCAATG TGGAACGCTA TCGCGACGCA CTCGAAAAAA GACCGGAAAA GATTCCACCG GTGCTGGGAC TGGTCGGGCG CGTGGTGCCG ATCAAGGATA TCAAAACGTT CATCCGGACA TTGCGGATGC TGGTCAAGGA GCGTCCTGAC GCGCAAGGAT GGGTTGTCGG CCCGGAGGAC GAAGATCCGT TGTATGTGAA TGAATGCAAG GAACTGGCCG AAAGTCTCGG CCTGGGGAAT CATCTCAGAT TCATGGGCTT TCAAAATATG CTGGAAATTC TGCCCCAGCT TGGCCTGATG GTACTGACCT CGATCAGTGA AGCTCTGCCG CTGGTAATAC TCGAGGCTTT CGCCAGCGGT GTTCCCTGCC TTGCGACGGA TGTAGGGTCC TGCCGGGAGC TCATCGAAGG AGCGGCCGAG CAGGACCGGG CATTGGGGGC GGCGGGTAGT GTGGTGCACA TTGCCGATCC AGAAGGCACG GCAAAAGCAG CCCTCGAACT GCTCAACAAC CCGGAGAAGT GGAGTGCTGC GCAGCAGGCG GGGCTTGCAC GCGTGAAACG CTATTATAAC GACCAACTCA TGTTTTCATC TTATCGGGAA GTCTACTCGG AAGCCTTGAG TGTACCGCGG CATCCGGCAG TAGCGAACTG A
|
Protein sequence | MSVIQAGAEK PWTVRAAGSV LEPVADIALL LEGTYPYVRG GVSSWVHQII NGLPEFTFAV IFIGGNKEMY GPAQYQFPPN VTHVETHYLL HRSHGKPHAR KGSRQPFQEM EELHAIMRQS GKLADRKMID AFARLGSKGG ISQEDFLYSE ASWDYITRQY KERCTEPSFV DYFWSIRAMH APLFVLADIA TGLPPVRAVH AISTGYAGLL GAMIRLRRNI PFVLTEHGIY TKERKIDLAQ ATWIHDHNDD VCNTLHEEMG YIRGLWIRFY EQLGRMAYAQ ASPIISLYEG NRLRQIADGA VPEKTRIITN GLNVERYRDA LEKRPEKIPP VLGLVGRVVP IKDIKTFIRT LRMLVKERPD AQGWVVGPED EDPLYVNECK ELAESLGLGN HLRFMGFQNM LEILPQLGLM VLTSISEALP LVILEAFASG VPCLATDVGS CRELIEGAAE QDRALGAAGS VVHIADPEGT AKAALELLNN PEKWSAAQQA GLARVKRYYN DQLMFSSYRE VYSEALSVPR HPAVAN
|
| |