Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2528 |
Symbol | |
ID | 3784033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2893941 |
End bp | 2895473 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637812619 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_413209 |
Protein GI | 82703643 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.597062 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGAGC TGACAACCCA ACTGCTGGTT TACCTGAAAG GGATATGGAA ATATCGTTGG GCTGCTGTGG CGGCAGCCTG GGTCGTCGCG GTGATCGGCT GGGTCATCGT TTATAAACTC CCTGATGATT ACCAGGCGTC CGCCAGGATT TATGTCGATA CGCAAAATGT GTTGAAGCCG TTGTTGCAGG GCATGACGGT TTCCCCAGAT ACGCAGCAAC AGATTTCGAT CATGAGCCGT ACCCTGATCA GCCGTCCCAA TGTGGAGAGA GTCATTCGCA TGGTGGATCT GGATATCAAG GCAAAAGACA CCAAGGATCA GGAAAGGCTC GTGAAGGAGC TCATGGATAA AATCAAACTG GGGACCACGG GACGGGATAA CCTCTTCACT ATTTCCTATA ACAACCAGAA TCCAAGGCTC GCCAAAGAGA TTGTCCAATC TTTATTGACC CTTTTTGTTG AAGGGGGACT TGGGAACAAG AGCCAGGATT CTTCATCGGC CATACGCTTC ATTGATGAGC AAATCAAATC CTATGAGGAG AAGCTGATCG CAGGGGAAAA TAATCTCAAG GCATTCAAAC AGAAAAACAT CGGAATAATG CCGCAACAAG GCAATGATTA CTATTCACAG TTGTCGCAGG CGATGGACGA TCTCAATAAA ACCAAGCTGG AACTGCGCGA GGCCCAGCAG GCGAGAGATG CGATCAAACG CCAGATCACC GGTGATGAAC CTGTTCTCCT CGTGGATCAG GGTGAAAGCG GATCGGCTTC ATCGATAGTC AATGTGGAAC TTGATTCTCG AATCCAGGCT TTGAACAAGA ATCTCGACGC ACTCAGGCTA AACTACACGG AGTTGCACCC GGATATCATT GCAGCGAAAC GGCTCATTGC CCAGCTTGAG GAACGCAAGA TCGAAGAGGC CAAACTGACC AGGAACGGCT CGGATCCGGG CAAAAACTAT AGTCCGATGT TGCAGCAACT CAATGTGGCA CTGGCGGATG CGGAAGCTGA CGTGGCATCC ATGAATGCCC GGGTGGAAGA ATATAGCGCT CGTTACGAGC GCCTCAAGTC CTTGAGCAAT GCAGTGCCGC AGGTTGAAGC TGAACTGGCC CAGCTCAACC GGGATTATCA GGTAAACAAG GCAAACTACG AAAAACTTCT CGAGCGGCGC GAATCAGCCA AAATATCGGG AGATCTGGGT TCCACCACAG ACCTGGTTTC GTTCCGTGTC ATTGATCCCC CGACAGTTTC TGACAGGCCC GTTGGCCCGG ACCGGGGAAA ATTCTTCTCC ATCATTTTCC TGGGTTCTTT GCTGGCGGGG ATCGGTATAG CCTTCGTTAT CAGCCAGGTT AGGCCTACCT TCCACAGCCA GACCAGTCTG CGGGAAATTT CGGGTAAGCC GATACTGGGG TCAATTCCGA TGATCTGGAC AGATAAGGAA AAGGTAAAGC GCAGAAAGCG CCTCTATGCA TTCGGATTAT CCTTGCTGTC CTTGTTAGGC CTGTATGGCA TCCTTATGCT GAAGATAGCG TGA
|
Protein sequence | MEELTTQLLV YLKGIWKYRW AAVAAAWVVA VIGWVIVYKL PDDYQASARI YVDTQNVLKP LLQGMTVSPD TQQQISIMSR TLISRPNVER VIRMVDLDIK AKDTKDQERL VKELMDKIKL GTTGRDNLFT ISYNNQNPRL AKEIVQSLLT LFVEGGLGNK SQDSSSAIRF IDEQIKSYEE KLIAGENNLK AFKQKNIGIM PQQGNDYYSQ LSQAMDDLNK TKLELREAQQ ARDAIKRQIT GDEPVLLVDQ GESGSASSIV NVELDSRIQA LNKNLDALRL NYTELHPDII AAKRLIAQLE ERKIEEAKLT RNGSDPGKNY SPMLQQLNVA LADAEADVAS MNARVEEYSA RYERLKSLSN AVPQVEAELA QLNRDYQVNK ANYEKLLERR ESAKISGDLG STTDLVSFRV IDPPTVSDRP VGPDRGKFFS IIFLGSLLAG IGIAFVISQV RPTFHSQTSL REISGKPILG SIPMIWTDKE KVKRRKRLYA FGLSLLSLLG LYGILMLKIA
|
| |