Gene Nmul_A1781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1781 
Symbol 
ID3784359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2034134 
End bp2035276 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content58% 
IMG OID637811867 
Productglycosyl transferase family protein 
Protein accessionYP_412470 
Protein GI82702904 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCAGCAC GTAATTTCCC CCCCGTTCTC CCTTTCTCAC CCCGCAAAAT CGCGGTTCTC 
CATGCCAAGG CGGTGGGTGA CTTTATCGTC ATCCTGCCCG CCCTGGATGC GATTAAACGG
ACATATCCCG AGGCAGAACT CATCCTGCTC GCCAAGCCGT GGGTAAAGGA ATTCTTCTCC
GGCCGCCCGT CGCCGGTTGA CCGTGTTCTC AGCCTTCCCC CACTTGCCGG TGTCAATGAT
CCGGTTGAGT CCAAAGGCCG CGTTCCCTCC ACAGGCAGCG GTGTCTACCC TGTCGAAGTA
GAACTGTTCT GCCAAGCGAT GCAAGGCGAG AAGCTCGATG TCGTCATTCA TATGCAAGGG
GATGGTAAAT CCGTCAATCC CTTCATCAAT AAATTTGCCG CCCGCGTGAC GGCAGGCATG
TGCAACCCGC CTGCCGAATC CCTCGACCGT TCCATTCCCT ATGTGCATTA CCAGAGCGAG
GTTCTGCGCA ATCTCGAAGT GGCCTCCTTA ATTGGCGCTC GTACCACGAG TACCGGATTC
GAACCGCGCA TCGAGGTGAC GGAATCCGAC GAACAGGAGA CAGAATCGGT CCGCCAGACG
ATCAAGGGCA AACCCTATGT CGTTATCCAC CCGGGCGCGG ATGATATCCG CAGGGTATGG
CCTGCAGTCA GGTTTGCCGA GGCCGCAGAT TGCCTGCTCG AAAAAGGATA TGCAGTTGTC
GTGACAGGAA CGCCGAAAGA AGAAGAGCGC GTGGCGACTG TCATTCGGGC CATGAGTCGG
CCTGCGATTC CCTGTACCCG ACTTGGCCTG TGCGGTCTGG CTGCCCTCTT ACAGCATAGC
GCGCTCGTCA TCAGCAACGA TACCGGCCCG CTCCATCTGG CACGCGCGGT GGGCGCACGT
ACAGTGGGAA TTTACTGGGC CCCCAACATC CTCAACTGGG GTCCGCTCAG CCGCGACCGG
CACCGGCTCG CGATCGGTTG GCAGCTTGAA TGTCCCCAAT GCGGCATCAG GCCTGTTTCG
CCCTGGCCGT TTCAGCCGCA GACATCCGAC TGCAGTCATC CGTACTCATT CGTGGAAAGT
GTACCGGTTG CAGAGGTTCT TGCATTGGCC ACTGAATTAT TGGGCCTGAC GACTGAGCAT
TGA
 
Protein sequence
MAARNFPPVL PFSPRKIAVL HAKAVGDFIV ILPALDAIKR TYPEAELILL AKPWVKEFFS 
GRPSPVDRVL SLPPLAGVND PVESKGRVPS TGSGVYPVEV ELFCQAMQGE KLDVVIHMQG
DGKSVNPFIN KFAARVTAGM CNPPAESLDR SIPYVHYQSE VLRNLEVASL IGARTTSTGF
EPRIEVTESD EQETESVRQT IKGKPYVVIH PGADDIRRVW PAVRFAEAAD CLLEKGYAVV
VTGTPKEEER VATVIRAMSR PAIPCTRLGL CGLAALLQHS ALVISNDTGP LHLARAVGAR
TVGIYWAPNI LNWGPLSRDR HRLAIGWQLE CPQCGIRPVS PWPFQPQTSD CSHPYSFVES
VPVAEVLALA TELLGLTTEH