Gene Nmul_A2399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2399 
Symbol 
ID3786180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2732866 
End bp2735856 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content48% 
IMG OID637812488 
Productglycosyl transferase, group 1 
Protein accessionYP_413080 
Protein GI82703514 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.969384 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACC TCTCGCCAAC TTATGTATTC GACATCCAGG AACAACTCTG GTCTCGTCCA 
GGGTATGGCG GTATTAACTA CAGCGATGGG GATGACATCG AGTTGCAGAT AGCGAAAGTG
ATCGCGGAGG CGGCAGATAT TACTGTCCTT TCAAGGGAAC TGCGACAACA TTGCACTGAT
TGGCCTTTTC TCTACCATTT GTCCGCGAGC CGTGCGAATA TCTTGCGCCC ATTTGCAACC
ACTCTAACCG GCGACATTCT GGAAATTGGC GCGGGTTGTG GCGCAATCAC ACGCTATCTC
GGTGAGGCTG GTGCAAATAT CCTGGCCTTA GAGGGAAGCC CAAGACGGGC CGCTATTGCT
CGTTCCAGAA CCCGGGATCT CGAAAACGTA ACGGTACTGG CGGAAAAATT CGACCAGTTT
GAATGTGACC ATCAATTTGA TTTAATCACT CTCATTGGCG TACTTGAGTA CGCCAATCTC
TTCACGACTG GCGAGAATCC AGCCCTTGCC ATGCTCCAGC GCGTCCGGTC TTTACTCAAG
CCGGAAGGCA CACTGATACT CGCCATCGAA AACCAGTTGG GCTTGAAATA TTTTGCTGGC
GCATTGGAAG ATCATATCGG CCAGCCCATG TACGGGATCG AGGGGCGATA CACCCGGAAT
CAACCCCAAA CATTTGGTCG AATGGTTCTG TCGAATCTGA TGAAAGAAGC CGGTTTCACC
ACCATCGAGT TTTTGGCTCC ATTTCCGGAT TACAAGCTGC CCGTTTCGAT TTTGACTGAA
GAAGGCCTTT CTAGTAAGGA GTTTGATGGT GCGGCATTGG CTTGGCAAAG CGTAAGGCGA
GATCACCAAC TCCCCTTAAG CACGAACTTT TCCATAGAAA TGGTTTGGCT CGAGGTATTC
AAGAACGGAT TGGCTTTGGA TATGGCTAAT TCTTTCTTGC TTGCAGCATC GCCGCTTCAA
CAAAAGATCG TTAAAGCGGG CATTCTAGGG TACCACTACA GTACTGATAG GATCCCTCAA
TATTGCAAAG AAACGATTTT TCAGCGTATT GAAAATAGCA CCACGGTCAA TTATCGGATG
TTAGGTAGTA GTCTTTTGGA TAATCGGACA GACCAAGTCA TTAAATTCGC ATGTCCGGAC
AGAGCAATCT ATGCAGAAGG ATTTCCCTTG TCTCTAGAGT TTACACGACT GCTTACGAGC
GACGGCTGGT CAATGAGCGG GGTTGGACTC CTTATTCACC GCTATATTGG TTTGCTCGAA
TTCATTGCGA ATCAAAAAGG GCACGCGTGG GACAAACCTT TACCACATAG GTTGCCCGGC
GAATTTTTCG ACATGGTCCC ACAAAATATC ATTATCGATA AAAATGGAGA ACCTGTTTCC
ATAGATACAG AATGGGTATT GAAAAATGGT ATTGAGGCTG GCTGGCTTCT ATTTCGCTCT
TTGCTGTTGA CGATTGATTC GATCATGTGT TTTGGCAGAA ATTTAGCAGA ACAAAAATTT
TCACGAAGGG AATTTATCAT ATCAGCCCTC GATGCAGCAG GATTCGCACT CACCAATGAA
GATTTCCAGC GATTTATTGA ATTGGAGGCC GATGTACAGC AGGAGGTAAC AGGGCGTCCT
TCGCATGAAT TCGTAAACTG GCGGCCAGAA CACCCATTGC CGACCTATTC TCCGGTTAAG
CATGGCGGAG AAATTGGCGC CTTGCAGAGC GAAGTTGGCG TGGCGTGGCG TGAGATTGAT
ATGATGCGTG ATGAAATCGA TGTGGCAAGG GCGAAAATCG AAATGCTGCA CGAAGAACTC
GATGCGTTAT ATCGCTCAAC TTCTTGGAGG ATTACCGCGC CTCTTCGGGG TATGCGGCGA
CTTATTACCC GATTCGCAGA CAGACCGAGT CCTTTAAGTC TAGCCCGTGC TACTCTGAAA
CACGGAACGG AATGCTATCA GACAAAGTTG CTAGCTGATC CACAAGTGTC GAGGCAACGC
ATCGTACACG TAATTGGCAA CTTTCTTACT GGCGGTTCGT CCAGACTGGT GGTCGATTTA
TTTGAGCGTC TCGGTCACCT CTATGAACAG GAGGTCGTCA CTCAATATAA CCCTGATCCC
CCTAACTATA CGGGGATTCC GATCCACGAA TTTTCCGGTG GAGACGCTTG GGATAAATTT
ATAACCTACT TGCGATTGTA CCGGCCTGAT CTTGTTCACG TGCACTATTG GGAGGATCCA
CACTGGTACG GAACGATGAT AAAGGCGGCC CGTGAATTCG GCTGCAAGAT AATACAAAAC
GTCAACACTC CAACTGCTCC TTATATGGAC AGTTGTATCA GCCGCTACAT ATATGTCAGT
GACTACGTAA AAACCCGGTT TGGAAAGAGG GATCAGCCGA GCATTACGAT CTACCCGGGC
AGCAACTTCA AATTATTTTC GAGAGATAAA TCTCGACCAG TACCTGACGA TTGCATCGGC
ATGGTTTATA GGCTGGATAT CGATAAACTC CATAGGAAAT CGATTGATGT TTTCATAAAG
GTTGTCCAGA AAAGGCCGCA GACAAAAGTA ATTATTGTTG GCGGGGGGCC CTACCTGGAG
CCTTATAAAG CGGCGGTCAA GGCGGGGAAA GTTGAACATG CTTTTACTTT TACGGGTTTT
GTTCCTTATG AAAGCCTGAT CGGGTTCTAT GCCCGGATGA GCCTTTTTGT GGCTCCCGTC
TGGAAAGAAA GCTTTGGGCA GGTCGGTCCC TTTGCTATGA GCATGGGCTT GCCAGTGGTG
GGCTACAATG TCGGCGCCTT GGCGGAGATC GTGGGCGACT GCGGTCTTCT TGCCCCACGA
GGCAACAGCG AAGCGCTAGC CGAAATCATT ATCGGCCTGC TGGATGATAA GGAGCGCCGT
GAACAAATAG GCTCCCGTAA TCGGGATCGG GCTCACAAAT TATTTTCCGT CGAAAACATG
GTAAATGACT ACCTCAAGCT GTACCAGGAG CTAATAGGCA ACACCCAATG A
 
Protein sequence
MKNLSPTYVF DIQEQLWSRP GYGGINYSDG DDIELQIAKV IAEAADITVL SRELRQHCTD 
WPFLYHLSAS RANILRPFAT TLTGDILEIG AGCGAITRYL GEAGANILAL EGSPRRAAIA
RSRTRDLENV TVLAEKFDQF ECDHQFDLIT LIGVLEYANL FTTGENPALA MLQRVRSLLK
PEGTLILAIE NQLGLKYFAG ALEDHIGQPM YGIEGRYTRN QPQTFGRMVL SNLMKEAGFT
TIEFLAPFPD YKLPVSILTE EGLSSKEFDG AALAWQSVRR DHQLPLSTNF SIEMVWLEVF
KNGLALDMAN SFLLAASPLQ QKIVKAGILG YHYSTDRIPQ YCKETIFQRI ENSTTVNYRM
LGSSLLDNRT DQVIKFACPD RAIYAEGFPL SLEFTRLLTS DGWSMSGVGL LIHRYIGLLE
FIANQKGHAW DKPLPHRLPG EFFDMVPQNI IIDKNGEPVS IDTEWVLKNG IEAGWLLFRS
LLLTIDSIMC FGRNLAEQKF SRREFIISAL DAAGFALTNE DFQRFIELEA DVQQEVTGRP
SHEFVNWRPE HPLPTYSPVK HGGEIGALQS EVGVAWREID MMRDEIDVAR AKIEMLHEEL
DALYRSTSWR ITAPLRGMRR LITRFADRPS PLSLARATLK HGTECYQTKL LADPQVSRQR
IVHVIGNFLT GGSSRLVVDL FERLGHLYEQ EVVTQYNPDP PNYTGIPIHE FSGGDAWDKF
ITYLRLYRPD LVHVHYWEDP HWYGTMIKAA REFGCKIIQN VNTPTAPYMD SCISRYIYVS
DYVKTRFGKR DQPSITIYPG SNFKLFSRDK SRPVPDDCIG MVYRLDIDKL HRKSIDVFIK
VVQKRPQTKV IIVGGGPYLE PYKAAVKAGK VEHAFTFTGF VPYESLIGFY ARMSLFVAPV
WKESFGQVGP FAMSMGLPVV GYNVGALAEI VGDCGLLAPR GNSEALAEII IGLLDDKERR
EQIGSRNRDR AHKLFSVENM VNDYLKLYQE LIGNTQ