Gene Nmul_A2520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2520 
Symbol 
ID3786645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2884056 
End bp2885261 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content53% 
IMG OID637812611 
Productglycosyl transferase, group 1 
Protein accessionYP_413201 
Protein GI82703635 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03088] sugar transferase, PEP-CTERM/EpsH1 system associated 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.303259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCCAA CCGTTAGTGA CAGGCTGCGG TTGAGTCCCG CGGAACACGT CAAGCAGCCG 
CCTCTCATTG CTCATGTGAT TTATCATCTG GGGGTGGGGG GACTGGAAAA TGGCGTAGTC
AATCTCATCA ATCATATTCC ACCCGATCGC TACCGGCATG CCATAGTTTG CCTGAAGGGG
TATTCGGACT TCAGGAGCAG GATTGTCAGC GAAGACGTGG AGGTTATCGC GTTGAACAAG
CGCGATGGAC ATGATTTCAA GCTATATATC GATCTTTTCA GAGCGCTCAG GCGCTTGAAG
CCCGACATCG TGCATACCCG TAATCTGGCT GCCATGGAAG GTCAGGTGAT TGCAGCTCTT
GCGGGGGCGC GGGCAAGAGT CCATGGCGAG CATGGGCGGG ATATGTTCGA CCTGCATGGT
AAAAACCGTA AATATAATTT ATTGAGAAAA GCGATTCGTC CGTTTATAAA CCATTTCATC
ACCGTCAGCA GGGATCTCGA AAGCTGGCTT GTCGATACGG TACGGGCAGC GCCGCATCGC
ATCAATCAAA TCTACAATGG CGTAGACAGC CGACGCTTTT ATCCGCGTAA AAGCACATCC
TTGAAGAACA ATAGGGTTCA GGGAGCGATT CCGGGATTTT TCAGGGAAGA TGCCTTTGTC
ATTGGCAGTG TCGGCCGCAT GGCAGATGTG AAGAATTACC TCGGCCTGAT AGAAGCATTT
TTACTTTTGC TGAAGGAAAT GCCTGCGGCT CACGAAAGAC TTCGGTTATT GATTGTCGGG
GCGGGGAGTA CCCGGCAGCG CTGCATTGAA AAGGTGCGTG AAGCGGGAAT CGAAGGACTT
GTCTGGTTTC CCGGTGAACG GGACGACATT CCTGAACTCA TGCGCAGCAT GGATCTGTTT
GTGCTTCCTT CGCTTGGAGA GGGCATTTCC AACACCATTC TCGAGGCTAT GTCTACCGGC
TTGCCCGTCG TCGCCACCCG GGTGGGAGGA AATGCGGAAC TGGTTGAGGA AGGCATGACA
GGAATGCTGG TTCCGCCGGG ATCGGCAACT GCGCTGGCAG GAGCCATACA GGAGTATTAC
AGAAATCCGG AGCTGTTGAT AGAACACGGC CGCGCTGCCC GAAAGCAGGT CGAGGCAAGG
TTCAGTATGG AAGCCATGAT GTCCGGATAT CTTGAAGTCT ATGACCGAGT GTTACGTAGG
GTATAA
 
Protein sequence
MQPTVSDRLR LSPAEHVKQP PLIAHVIYHL GVGGLENGVV NLINHIPPDR YRHAIVCLKG 
YSDFRSRIVS EDVEVIALNK RDGHDFKLYI DLFRALRRLK PDIVHTRNLA AMEGQVIAAL
AGARARVHGE HGRDMFDLHG KNRKYNLLRK AIRPFINHFI TVSRDLESWL VDTVRAAPHR
INQIYNGVDS RRFYPRKSTS LKNNRVQGAI PGFFREDAFV IGSVGRMADV KNYLGLIEAF
LLLLKEMPAA HERLRLLIVG AGSTRQRCIE KVREAGIEGL VWFPGERDDI PELMRSMDLF
VLPSLGEGIS NTILEAMSTG LPVVATRVGG NAELVEEGMT GMLVPPGSAT ALAGAIQEYY
RNPELLIEHG RAARKQVEAR FSMEAMMSGY LEVYDRVLRR V