Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2522 |
Symbol | |
ID | 3786647 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2886889 |
End bp | 2888094 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637812613 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_413203 |
Protein GI | 82703637 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | [TIGR03087] sugar transferase, PEP-CTERM/EpsH1 system associated |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.165629 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGAAT TGCTTTATCT GGTCCATCGA ATTCCTTATC CCCCGAATAA GGGGGACAAG ATCCGGTCCT ATCATCTGCT GGAACATCTG GCACAACACC ACCGCGTCCA TCTTGGCACC TTTATCGACG ACGAGGGGGA CTGGAAATAT ATCGAGAAGG TAAGAAGCCT CTGCGGGGAA ACCTGTTTCA TCAACCTCCA TCCAGGGATG GCCAGGGGGC GCAGCCTTTC GGGATTGCTT TGCGGGAAGC CCCTTACCCT TCCCTACTAC TGGAACCCGC GCCTGCAAGC ATGGGTAAAT CATGTACTTG GCACAAGACC TGTTGAGAAT ATCCTGATCT TTTCCTCAGG AATGGCGCAA TATGTAAGTC GGGTGCAGCA TATCCGCCGG ATTATCGATT TTGTCGACAT TGATTCGGAC AAGTGGATGC AATATTCGAC ATCAGCAGGC TGGCCGATGA ACTGGATATA CCGAAGAGAA TCGAGATTGC TGCTGGGCTA TGAAAAGGAG ATCGCGCGCG CATTCGACAG CGCCACTTTC GTCTCCGAGA CAGAAGCCGA CCTGTTTCAC AGATTGCTAC CCGAAGCTGC TGCAAAAGTG ACTCACTTCA ATAACGGCGT CGATGCCAAT TATTTTTCAC CGCAAAATAG CTATCCCAAT CCGTATCCGG AGGGGAAGCG CATTCTTGTG TTCACTGGCG CGATGGATTA TCGGGCCAAT GTTGACGCAG TTGCCTGGTT TGCAAGGGCT GTTTTTCCGG CGATCCGCGC GAAGCTACCG GAAGTCGAAT TCTATATTGT CGGAGCACGC CCATCCGATA CCGTAGCAGC TCTCTCGGCA TTTCCGGGCA TCAGGGTAAC GGGGTTCGTG TCCGACATCC GGCCCTATCT GGCGCATGCC TCGCTGGTGG TTGCGCCGCT ACGCATTGCG CGTGGAATAC AGAACAAAGT CCTGGAGGCA ATGGCCATGG AGAAAATCGT CATAGCTTCT CCGTCGGCTG CGGAAGGCAT TCGCGCGCGA AGGGAGGAGG AACTGGTGGT TGCCCTTGAT GAACAGGACT TCGCCCATCG GGTTATCTCC TTTCTCCAGA ACGGCGAGCA TCCTGGAATC TGTCGAGCTG CAAGAGTGCG CGTGCTGGAA GATTACAGCT GGAAAAATGG TTTGGGACGT ATCGACAGAC TCCTGTCACA ACCCCAGGCA GTTTAG
|
Protein sequence | MRELLYLVHR IPYPPNKGDK IRSYHLLEHL AQHHRVHLGT FIDDEGDWKY IEKVRSLCGE TCFINLHPGM ARGRSLSGLL CGKPLTLPYY WNPRLQAWVN HVLGTRPVEN ILIFSSGMAQ YVSRVQHIRR IIDFVDIDSD KWMQYSTSAG WPMNWIYRRE SRLLLGYEKE IARAFDSATF VSETEADLFH RLLPEAAAKV THFNNGVDAN YFSPQNSYPN PYPEGKRILV FTGAMDYRAN VDAVAWFARA VFPAIRAKLP EVEFYIVGAR PSDTVAALSA FPGIRVTGFV SDIRPYLAHA SLVVAPLRIA RGIQNKVLEA MAMEKIVIAS PSAAEGIRAR REEELVVALD EQDFAHRVIS FLQNGEHPGI CRAARVRVLE DYSWKNGLGR IDRLLSQPQA V
|
| |