Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0239 |
Symbol | |
ID | 3785725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 255239 |
End bp | 256657 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637810314 |
Product | sugar transferase |
Protein accession | YP_410939 |
Protein GI | 82701373 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2148] Sugar transferases involved in lipopolysaccharide synthesis |
TIGRFAM ID | [TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase [TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGAGG CAAATCCTAC ATTTTCCGCT TCAACCGTTA CCGCCCGCCA GGTGAAAGGC CCGCTGGCTA ATAATATGCT GAGCGCGGTC GAGTCGGTAC TTGATCCGGC AGTACTGACA CTTTCGTTAT GGCTGATAAG CACCAGTTTT GAAGGCGAAT TGCTCCCGCC TTATTTGATC CTGTCGGTGA TCGTATTCTC TATCTCCTTT CCAGGTGCGT CACATCTCCG GTCATCCGTC AAGGTTCTCA TCTTCGACGT GCTCTACACC TGGTTCTGGA TGGCTTTGCT ACTGTTTTTT CTCGGTTTTG CGACCGGCTA TATCGCCCAA TTCTCCAGCC AGGTGCTCAT TACATGGCTA TGGGCTGCCC CCCTAAGTCA AATCGGCGTC CATCTGCTGT TTCGTATCGC CGCACCCTAT CTTCTCATGC TCCAGGGTCC GCCACAGCGC GCCATCATCG TCGGCATGAA CGAGCAGGGG GTCGCCCTCG CCCGTCGTAT TCATGAGACG CGTTATTCGA ATATGGAACT GTCCGGTTTT TTTGATGATC GGAATGAGAG CCGACTTTCG CATGCGCGGA ATAATCCCTT GCTCGGAAGG CTTCGAGAGC TTCCCGAATT TGTCAAGGAG CATCGCATCC AGTTCATTTA TCTGTCGTTG CCGATGGCTT CCCAGCCACG CATCCTGCAT GTTCTCGAAG AACTGAAAGA TACGACGGCC TCCATCTATT TCGTACCCGA CATGTTTATT ACGGATCTTA TTCAGGCGCG AAGTGGCACA GTGTGTGGTA CGCCGGTTAT CGCTGTCTGC GAATCGCCTT TTACGGGCTC CAATGGGATG GTCAAGCGCG CAAGCGACAT TATCCTCTCC CTGCTTATAC TGCTACTCAT ATCACCACTT CTGCTGCTCA TCGCGGTTGC CATCCGGCTG GATTCGCCAG GGCCCATTAT TTTCAAGCAG CGACGCTATG GGCTGGACGG AGAGGAAATC CTTGTTTACA AGTTTAGATC GATGAGCGTC TGCGAAGATG GGAATACCAT CCGGCAAGCA CAAAGGAATG ATAACCGCAT AACCCGCGTC GGGGCTTTCC TGCGAAGCAA TTCCCTGGAT GAATTGCCGC AGTTCATCAA TGTATTGCAG GGGCGCATGA GCATCGTGGG CCCCAGGCCT CATGCAGTAG CGCACAACGA GATATATCGC AACCTTATAA AAGGCTACAT GATCCGGCAC AAGGTAAAGC CGGGAATTAC AGGTTGGGCG CAGGTGAACG GCTACCGTGG GGAGACGCGG ACCCTGGACA AGATGCAGGC GCGTATCGAT CATGATCTCG ATTATTTACG CAACTGGTCG TTGCGGCTCG ATTTGCACAT CATCTGCAAG ACTATCCTGG TAGTACTGAA AGATCGGGCT GCATATTAA
|
Protein sequence | MAEANPTFSA STVTARQVKG PLANNMLSAV ESVLDPAVLT LSLWLISTSF EGELLPPYLI LSVIVFSISF PGASHLRSSV KVLIFDVLYT WFWMALLLFF LGFATGYIAQ FSSQVLITWL WAAPLSQIGV HLLFRIAAPY LLMLQGPPQR AIIVGMNEQG VALARRIHET RYSNMELSGF FDDRNESRLS HARNNPLLGR LRELPEFVKE HRIQFIYLSL PMASQPRILH VLEELKDTTA SIYFVPDMFI TDLIQARSGT VCGTPVIAVC ESPFTGSNGM VKRASDIILS LLILLLISPL LLLIAVAIRL DSPGPIIFKQ RRYGLDGEEI LVYKFRSMSV CEDGNTIRQA QRNDNRITRV GAFLRSNSLD ELPQFINVLQ GRMSIVGPRP HAVAHNEIYR NLIKGYMIRH KVKPGITGWA QVNGYRGETR TLDKMQARID HDLDYLRNWS LRLDLHIICK TILVVLKDRA AY
|
| |