Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0300 |
Symbol | |
ID | 3785333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 323864 |
End bp | 325204 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637810376 |
Product | O-antigen polymerase |
Protein accession | YP_411000 |
Protein GI | 82701434 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | [TIGR03097] probable O-glycosylation ligase, exosortase system type 1-associated |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGATA TCCTTGTTAC CCTGATCGTC TTCGGATGGC TGCCCTTCGT CTTTAAAAAG CCGTACATGG GGGCGTTGAT GTGGGTGTGG ATAAGCGTCA TGAATCCGCA TACCCAGGGA TGGGGATTCG CCACCACATT TCCCTTTGCC GCGATTATTG CCGGAGTCAC GATAACCAGC CTCTTGATTA CCAAAGAACC AAAGAATCTC CCTCTAACCC CGGTTACCTG GACTTTCATA ATCTTCGTGC TCTGGATGAA TCTGAGCACC GTATTTGCCA TTTATCCCGA CGAAACGTAC ATCCAGTGGA ACAAGGTCAT GAAGATCATG CTGATGAGTT TTGTGGTCAT CATGCTGATC AGGACACAAA AGCAGATTCA ACTCCTCATC TGGACAATAG TTATCTCGCT GGGATATTAC GGCATCAAAG GGGGCGTTTT TACGCTGCTC GGAGGAGGTG TTGATCTCGT CATGGGACCC GAGAGCACCT TCATCGAGGG AAATAATGAG ATCGCCCTGG CGCTCATCAT GACCATTCCT CTCATGCATT ATGTGCAGAT GATTTCCCGG AAAACCTGGG TGAAGCATGC CATGACGGCG GCCATGCTGC TATCTGCCTT TGCCGCGCTG GGTTCTTATT CACGGGGCGC TCTGCTGGCC ATTGCGGCAA TGGGAGGATT TCTCTGGCTC AAAAGTCAAC ATAAGGGCCG CATCGCTCTA CTGGTTATTC TCGCCATTCT CCCGGCGATT GCGTATATGC CGGAGCAGTG GATGCAAAGA ATGGACAGCA TCAATACTTA TGAAGAGGAC GCGTCGGTTC AGGGCCGTTT CAATGCCTGG TGGATGGCCT ACAATCTGGC CAAGGACCGG CCGCTCACCG GGGGTGGTTT CGACATTATA AGTCCTGAGC TGTTTTTCGC TTACGCACCC AATCCCGAGG ATATACATGC TGCCCACAGC ATTTATTTTC AGGCGCTCGG TGAGCACGGT TTCGTGGGGC TGGGGCTTTA CTTGCTGCTC GGGTGGCTCA CCTGGCGAAC CGGGTCATGG ATCATACGCA ACACCAGTAA GCTCGACGAG TACAGGTGGG CTTTCAATCT GGCGACGATG ATTCAGGTAT CGTTCATCGG TTTTGCTACG GGAGGTGCTT TCCTTAGTCT TCTTTATTTT GACGTTCCCT ACTATCTCAT GGGGGCAATG GTTGCAACGC GTGTCTTGGT GGAAAAAGAA CTCAAGCAAA AAACCATGTC GGCAATCGCT GCAAAAACCT CCGGTTCTTC GCGCCAGCTT GGACAGTTGA GTCCATCACA GTCGCAGCCG ATAGCCGGGG ATTCAAGCTA G
|
Protein sequence | MRDILVTLIV FGWLPFVFKK PYMGALMWVW ISVMNPHTQG WGFATTFPFA AIIAGVTITS LLITKEPKNL PLTPVTWTFI IFVLWMNLST VFAIYPDETY IQWNKVMKIM LMSFVVIMLI RTQKQIQLLI WTIVISLGYY GIKGGVFTLL GGGVDLVMGP ESTFIEGNNE IALALIMTIP LMHYVQMISR KTWVKHAMTA AMLLSAFAAL GSYSRGALLA IAAMGGFLWL KSQHKGRIAL LVILAILPAI AYMPEQWMQR MDSINTYEED ASVQGRFNAW WMAYNLAKDR PLTGGGFDII SPELFFAYAP NPEDIHAAHS IYFQALGEHG FVGLGLYLLL GWLTWRTGSW IIRNTSKLDE YRWAFNLATM IQVSFIGFAT GGAFLSLLYF DVPYYLMGAM VATRVLVEKE LKQKTMSAIA AKTSGSSRQL GQLSPSQSQP IAGDSS
|
| |