Gene Nmul_A2528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2528 
Symbol 
ID3784033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2893941 
End bp2895473 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content51% 
IMG OID637812619 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_413209 
Protein GI82703643 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.597062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGAGC TGACAACCCA ACTGCTGGTT TACCTGAAAG GGATATGGAA ATATCGTTGG 
GCTGCTGTGG CGGCAGCCTG GGTCGTCGCG GTGATCGGCT GGGTCATCGT TTATAAACTC
CCTGATGATT ACCAGGCGTC CGCCAGGATT TATGTCGATA CGCAAAATGT GTTGAAGCCG
TTGTTGCAGG GCATGACGGT TTCCCCAGAT ACGCAGCAAC AGATTTCGAT CATGAGCCGT
ACCCTGATCA GCCGTCCCAA TGTGGAGAGA GTCATTCGCA TGGTGGATCT GGATATCAAG
GCAAAAGACA CCAAGGATCA GGAAAGGCTC GTGAAGGAGC TCATGGATAA AATCAAACTG
GGGACCACGG GACGGGATAA CCTCTTCACT ATTTCCTATA ACAACCAGAA TCCAAGGCTC
GCCAAAGAGA TTGTCCAATC TTTATTGACC CTTTTTGTTG AAGGGGGACT TGGGAACAAG
AGCCAGGATT CTTCATCGGC CATACGCTTC ATTGATGAGC AAATCAAATC CTATGAGGAG
AAGCTGATCG CAGGGGAAAA TAATCTCAAG GCATTCAAAC AGAAAAACAT CGGAATAATG
CCGCAACAAG GCAATGATTA CTATTCACAG TTGTCGCAGG CGATGGACGA TCTCAATAAA
ACCAAGCTGG AACTGCGCGA GGCCCAGCAG GCGAGAGATG CGATCAAACG CCAGATCACC
GGTGATGAAC CTGTTCTCCT CGTGGATCAG GGTGAAAGCG GATCGGCTTC ATCGATAGTC
AATGTGGAAC TTGATTCTCG AATCCAGGCT TTGAACAAGA ATCTCGACGC ACTCAGGCTA
AACTACACGG AGTTGCACCC GGATATCATT GCAGCGAAAC GGCTCATTGC CCAGCTTGAG
GAACGCAAGA TCGAAGAGGC CAAACTGACC AGGAACGGCT CGGATCCGGG CAAAAACTAT
AGTCCGATGT TGCAGCAACT CAATGTGGCA CTGGCGGATG CGGAAGCTGA CGTGGCATCC
ATGAATGCCC GGGTGGAAGA ATATAGCGCT CGTTACGAGC GCCTCAAGTC CTTGAGCAAT
GCAGTGCCGC AGGTTGAAGC TGAACTGGCC CAGCTCAACC GGGATTATCA GGTAAACAAG
GCAAACTACG AAAAACTTCT CGAGCGGCGC GAATCAGCCA AAATATCGGG AGATCTGGGT
TCCACCACAG ACCTGGTTTC GTTCCGTGTC ATTGATCCCC CGACAGTTTC TGACAGGCCC
GTTGGCCCGG ACCGGGGAAA ATTCTTCTCC ATCATTTTCC TGGGTTCTTT GCTGGCGGGG
ATCGGTATAG CCTTCGTTAT CAGCCAGGTT AGGCCTACCT TCCACAGCCA GACCAGTCTG
CGGGAAATTT CGGGTAAGCC GATACTGGGG TCAATTCCGA TGATCTGGAC AGATAAGGAA
AAGGTAAAGC GCAGAAAGCG CCTCTATGCA TTCGGATTAT CCTTGCTGTC CTTGTTAGGC
CTGTATGGCA TCCTTATGCT GAAGATAGCG TGA
 
Protein sequence
MEELTTQLLV YLKGIWKYRW AAVAAAWVVA VIGWVIVYKL PDDYQASARI YVDTQNVLKP 
LLQGMTVSPD TQQQISIMSR TLISRPNVER VIRMVDLDIK AKDTKDQERL VKELMDKIKL
GTTGRDNLFT ISYNNQNPRL AKEIVQSLLT LFVEGGLGNK SQDSSSAIRF IDEQIKSYEE
KLIAGENNLK AFKQKNIGIM PQQGNDYYSQ LSQAMDDLNK TKLELREAQQ ARDAIKRQIT
GDEPVLLVDQ GESGSASSIV NVELDSRIQA LNKNLDALRL NYTELHPDII AAKRLIAQLE
ERKIEEAKLT RNGSDPGKNY SPMLQQLNVA LADAEADVAS MNARVEEYSA RYERLKSLSN
AVPQVEAELA QLNRDYQVNK ANYEKLLERR ESAKISGDLG STTDLVSFRV IDPPTVSDRP
VGPDRGKFFS IIFLGSLLAG IGIAFVISQV RPTFHSQTSL REISGKPILG SIPMIWTDKE
KVKRRKRLYA FGLSLLSLLG LYGILMLKIA