Gene Nmul_A2522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2522 
Symbol 
ID3786647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2886889 
End bp2888094 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content54% 
IMG OID637812613 
Productglycosyl transferase, group 1 
Protein accessionYP_413203 
Protein GI82703637 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03087] sugar transferase, PEP-CTERM/EpsH1 system associated 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.165629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGAAT TGCTTTATCT GGTCCATCGA ATTCCTTATC CCCCGAATAA GGGGGACAAG 
ATCCGGTCCT ATCATCTGCT GGAACATCTG GCACAACACC ACCGCGTCCA TCTTGGCACC
TTTATCGACG ACGAGGGGGA CTGGAAATAT ATCGAGAAGG TAAGAAGCCT CTGCGGGGAA
ACCTGTTTCA TCAACCTCCA TCCAGGGATG GCCAGGGGGC GCAGCCTTTC GGGATTGCTT
TGCGGGAAGC CCCTTACCCT TCCCTACTAC TGGAACCCGC GCCTGCAAGC ATGGGTAAAT
CATGTACTTG GCACAAGACC TGTTGAGAAT ATCCTGATCT TTTCCTCAGG AATGGCGCAA
TATGTAAGTC GGGTGCAGCA TATCCGCCGG ATTATCGATT TTGTCGACAT TGATTCGGAC
AAGTGGATGC AATATTCGAC ATCAGCAGGC TGGCCGATGA ACTGGATATA CCGAAGAGAA
TCGAGATTGC TGCTGGGCTA TGAAAAGGAG ATCGCGCGCG CATTCGACAG CGCCACTTTC
GTCTCCGAGA CAGAAGCCGA CCTGTTTCAC AGATTGCTAC CCGAAGCTGC TGCAAAAGTG
ACTCACTTCA ATAACGGCGT CGATGCCAAT TATTTTTCAC CGCAAAATAG CTATCCCAAT
CCGTATCCGG AGGGGAAGCG CATTCTTGTG TTCACTGGCG CGATGGATTA TCGGGCCAAT
GTTGACGCAG TTGCCTGGTT TGCAAGGGCT GTTTTTCCGG CGATCCGCGC GAAGCTACCG
GAAGTCGAAT TCTATATTGT CGGAGCACGC CCATCCGATA CCGTAGCAGC TCTCTCGGCA
TTTCCGGGCA TCAGGGTAAC GGGGTTCGTG TCCGACATCC GGCCCTATCT GGCGCATGCC
TCGCTGGTGG TTGCGCCGCT ACGCATTGCG CGTGGAATAC AGAACAAAGT CCTGGAGGCA
ATGGCCATGG AGAAAATCGT CATAGCTTCT CCGTCGGCTG CGGAAGGCAT TCGCGCGCGA
AGGGAGGAGG AACTGGTGGT TGCCCTTGAT GAACAGGACT TCGCCCATCG GGTTATCTCC
TTTCTCCAGA ACGGCGAGCA TCCTGGAATC TGTCGAGCTG CAAGAGTGCG CGTGCTGGAA
GATTACAGCT GGAAAAATGG TTTGGGACGT ATCGACAGAC TCCTGTCACA ACCCCAGGCA
GTTTAG
 
Protein sequence
MRELLYLVHR IPYPPNKGDK IRSYHLLEHL AQHHRVHLGT FIDDEGDWKY IEKVRSLCGE 
TCFINLHPGM ARGRSLSGLL CGKPLTLPYY WNPRLQAWVN HVLGTRPVEN ILIFSSGMAQ
YVSRVQHIRR IIDFVDIDSD KWMQYSTSAG WPMNWIYRRE SRLLLGYEKE IARAFDSATF
VSETEADLFH RLLPEAAAKV THFNNGVDAN YFSPQNSYPN PYPEGKRILV FTGAMDYRAN
VDAVAWFARA VFPAIRAKLP EVEFYIVGAR PSDTVAALSA FPGIRVTGFV SDIRPYLAHA
SLVVAPLRIA RGIQNKVLEA MAMEKIVIAS PSAAEGIRAR REEELVVALD EQDFAHRVIS
FLQNGEHPGI CRAARVRVLE DYSWKNGLGR IDRLLSQPQA V