Gene Nmul_A1783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1783 
Symbol 
ID3784361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2036280 
End bp2037287 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content56% 
IMG OID637811869 
Productglycosyl transferase family protein 
Protein accessionYP_412472 
Protein GI82702906 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCAGA TGGGTAATCT GCCAATCAAG AAGATCGCGA TTTTCAGGGC CTTGAAGCTG 
GGGGATTTTC TGTTGTTCGT CCCTGCGCTG CGGGCGATTA GGCGGGCTTT TCCGCAAGCC
ACCATCGATT ATGTCGGGTT GCCCTGGAAC AAGGCGCTCG CCGCGCGTTA CAATCATTAT
ATCGACGAAT TTATCGAATT TCCGGGTTTT CCCGGATTGC CCGAGCATCC CTTCAGGGCT
GAAGCAGTCA CGGCTTTTCT GGACGGCATG CAGCGGCGAC AATACGACCT TGCCCTGCAA
ATGCATGGCA AAGGTACAGT ATCTAATCTT GTCGTCTCTC TTTTCGGGGC GGCTATTGCG
GCCGGGTTTG CGAGCGAAGG CAACTCTCAC TGGCCTAACC GCGATTTCTT CATGCCATAC
CCCTCCAGGC AGCCTGAGTT GCTAAGGAAT CTGGCGTTAC TCGAGTTTCT CGGCATGGAG
CAGGCTGATC GCGCGGCGGA CAGAACCATG GAATTTCCGT TATTGGACAT GGACTGCCAG
AAACTTCGGG AACTGCAAGA ATATGGGACT ATTCGTGATA AACCTTATGT CTGCCTGCAT
CCCGGCGCGA TTTCCGCAAC CCCCTGGCCT GCTGCTCATT TCGCGGAGGT GGCGGACAGG
TGTATTCGGC AGGGTTTGAA AGTGGTGTTG ACGGGTACTG CGGAAGAGAA GCCGCTCACG
CAAGCGGTCG CCGGGAAAAT GACGGGTACG GCGATTGATC TTGCCGGTAA AACCGCCATC
GGGGCACTCG CCGCCCTTCT GAAGGGCAGC CGGGCGGTGA TCTCGAATGA CACGGGAGTT
GCGCATTTGG CGGTAGCGGT CGATGCCCCG AGTGTCACCG TCTTTACCAC GACCGATCCG
CTGATTTGGG GTCCGTTGGA TCAGGTTCAT CATCGGGTTG TTTCGGGAAA CGACGTGAAG
ACGCCGGAAA TGGCAATACG GGCGCTGGAG GAATTAATTG GGCGTTAA
 
Protein sequence
MAQMGNLPIK KIAIFRALKL GDFLLFVPAL RAIRRAFPQA TIDYVGLPWN KALAARYNHY 
IDEFIEFPGF PGLPEHPFRA EAVTAFLDGM QRRQYDLALQ MHGKGTVSNL VVSLFGAAIA
AGFASEGNSH WPNRDFFMPY PSRQPELLRN LALLEFLGME QADRAADRTM EFPLLDMDCQ
KLRELQEYGT IRDKPYVCLH PGAISATPWP AAHFAEVADR CIRQGLKVVL TGTAEEKPLT
QAVAGKMTGT AIDLAGKTAI GALAALLKGS RAVISNDTGV AHLAVAVDAP SVTVFTTTDP
LIWGPLDQVH HRVVSGNDVK TPEMAIRALE ELIGR