Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A2399 |
Symbol | |
ID | 3786180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 2732866 |
End bp | 2735856 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637812488 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_413080 |
Protein GI | 82703514 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.969384 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACC TCTCGCCAAC TTATGTATTC GACATCCAGG AACAACTCTG GTCTCGTCCA GGGTATGGCG GTATTAACTA CAGCGATGGG GATGACATCG AGTTGCAGAT AGCGAAAGTG ATCGCGGAGG CGGCAGATAT TACTGTCCTT TCAAGGGAAC TGCGACAACA TTGCACTGAT TGGCCTTTTC TCTACCATTT GTCCGCGAGC CGTGCGAATA TCTTGCGCCC ATTTGCAACC ACTCTAACCG GCGACATTCT GGAAATTGGC GCGGGTTGTG GCGCAATCAC ACGCTATCTC GGTGAGGCTG GTGCAAATAT CCTGGCCTTA GAGGGAAGCC CAAGACGGGC CGCTATTGCT CGTTCCAGAA CCCGGGATCT CGAAAACGTA ACGGTACTGG CGGAAAAATT CGACCAGTTT GAATGTGACC ATCAATTTGA TTTAATCACT CTCATTGGCG TACTTGAGTA CGCCAATCTC TTCACGACTG GCGAGAATCC AGCCCTTGCC ATGCTCCAGC GCGTCCGGTC TTTACTCAAG CCGGAAGGCA CACTGATACT CGCCATCGAA AACCAGTTGG GCTTGAAATA TTTTGCTGGC GCATTGGAAG ATCATATCGG CCAGCCCATG TACGGGATCG AGGGGCGATA CACCCGGAAT CAACCCCAAA CATTTGGTCG AATGGTTCTG TCGAATCTGA TGAAAGAAGC CGGTTTCACC ACCATCGAGT TTTTGGCTCC ATTTCCGGAT TACAAGCTGC CCGTTTCGAT TTTGACTGAA GAAGGCCTTT CTAGTAAGGA GTTTGATGGT GCGGCATTGG CTTGGCAAAG CGTAAGGCGA GATCACCAAC TCCCCTTAAG CACGAACTTT TCCATAGAAA TGGTTTGGCT CGAGGTATTC AAGAACGGAT TGGCTTTGGA TATGGCTAAT TCTTTCTTGC TTGCAGCATC GCCGCTTCAA CAAAAGATCG TTAAAGCGGG CATTCTAGGG TACCACTACA GTACTGATAG GATCCCTCAA TATTGCAAAG AAACGATTTT TCAGCGTATT GAAAATAGCA CCACGGTCAA TTATCGGATG TTAGGTAGTA GTCTTTTGGA TAATCGGACA GACCAAGTCA TTAAATTCGC ATGTCCGGAC AGAGCAATCT ATGCAGAAGG ATTTCCCTTG TCTCTAGAGT TTACACGACT GCTTACGAGC GACGGCTGGT CAATGAGCGG GGTTGGACTC CTTATTCACC GCTATATTGG TTTGCTCGAA TTCATTGCGA ATCAAAAAGG GCACGCGTGG GACAAACCTT TACCACATAG GTTGCCCGGC GAATTTTTCG ACATGGTCCC ACAAAATATC ATTATCGATA AAAATGGAGA ACCTGTTTCC ATAGATACAG AATGGGTATT GAAAAATGGT ATTGAGGCTG GCTGGCTTCT ATTTCGCTCT TTGCTGTTGA CGATTGATTC GATCATGTGT TTTGGCAGAA ATTTAGCAGA ACAAAAATTT TCACGAAGGG AATTTATCAT ATCAGCCCTC GATGCAGCAG GATTCGCACT CACCAATGAA GATTTCCAGC GATTTATTGA ATTGGAGGCC GATGTACAGC AGGAGGTAAC AGGGCGTCCT TCGCATGAAT TCGTAAACTG GCGGCCAGAA CACCCATTGC CGACCTATTC TCCGGTTAAG CATGGCGGAG AAATTGGCGC CTTGCAGAGC GAAGTTGGCG TGGCGTGGCG TGAGATTGAT ATGATGCGTG ATGAAATCGA TGTGGCAAGG GCGAAAATCG AAATGCTGCA CGAAGAACTC GATGCGTTAT ATCGCTCAAC TTCTTGGAGG ATTACCGCGC CTCTTCGGGG TATGCGGCGA CTTATTACCC GATTCGCAGA CAGACCGAGT CCTTTAAGTC TAGCCCGTGC TACTCTGAAA CACGGAACGG AATGCTATCA GACAAAGTTG CTAGCTGATC CACAAGTGTC GAGGCAACGC ATCGTACACG TAATTGGCAA CTTTCTTACT GGCGGTTCGT CCAGACTGGT GGTCGATTTA TTTGAGCGTC TCGGTCACCT CTATGAACAG GAGGTCGTCA CTCAATATAA CCCTGATCCC CCTAACTATA CGGGGATTCC GATCCACGAA TTTTCCGGTG GAGACGCTTG GGATAAATTT ATAACCTACT TGCGATTGTA CCGGCCTGAT CTTGTTCACG TGCACTATTG GGAGGATCCA CACTGGTACG GAACGATGAT AAAGGCGGCC CGTGAATTCG GCTGCAAGAT AATACAAAAC GTCAACACTC CAACTGCTCC TTATATGGAC AGTTGTATCA GCCGCTACAT ATATGTCAGT GACTACGTAA AAACCCGGTT TGGAAAGAGG GATCAGCCGA GCATTACGAT CTACCCGGGC AGCAACTTCA AATTATTTTC GAGAGATAAA TCTCGACCAG TACCTGACGA TTGCATCGGC ATGGTTTATA GGCTGGATAT CGATAAACTC CATAGGAAAT CGATTGATGT TTTCATAAAG GTTGTCCAGA AAAGGCCGCA GACAAAAGTA ATTATTGTTG GCGGGGGGCC CTACCTGGAG CCTTATAAAG CGGCGGTCAA GGCGGGGAAA GTTGAACATG CTTTTACTTT TACGGGTTTT GTTCCTTATG AAAGCCTGAT CGGGTTCTAT GCCCGGATGA GCCTTTTTGT GGCTCCCGTC TGGAAAGAAA GCTTTGGGCA GGTCGGTCCC TTTGCTATGA GCATGGGCTT GCCAGTGGTG GGCTACAATG TCGGCGCCTT GGCGGAGATC GTGGGCGACT GCGGTCTTCT TGCCCCACGA GGCAACAGCG AAGCGCTAGC CGAAATCATT ATCGGCCTGC TGGATGATAA GGAGCGCCGT GAACAAATAG GCTCCCGTAA TCGGGATCGG GCTCACAAAT TATTTTCCGT CGAAAACATG GTAAATGACT ACCTCAAGCT GTACCAGGAG CTAATAGGCA ACACCCAATG A
|
Protein sequence | MKNLSPTYVF DIQEQLWSRP GYGGINYSDG DDIELQIAKV IAEAADITVL SRELRQHCTD WPFLYHLSAS RANILRPFAT TLTGDILEIG AGCGAITRYL GEAGANILAL EGSPRRAAIA RSRTRDLENV TVLAEKFDQF ECDHQFDLIT LIGVLEYANL FTTGENPALA MLQRVRSLLK PEGTLILAIE NQLGLKYFAG ALEDHIGQPM YGIEGRYTRN QPQTFGRMVL SNLMKEAGFT TIEFLAPFPD YKLPVSILTE EGLSSKEFDG AALAWQSVRR DHQLPLSTNF SIEMVWLEVF KNGLALDMAN SFLLAASPLQ QKIVKAGILG YHYSTDRIPQ YCKETIFQRI ENSTTVNYRM LGSSLLDNRT DQVIKFACPD RAIYAEGFPL SLEFTRLLTS DGWSMSGVGL LIHRYIGLLE FIANQKGHAW DKPLPHRLPG EFFDMVPQNI IIDKNGEPVS IDTEWVLKNG IEAGWLLFRS LLLTIDSIMC FGRNLAEQKF SRREFIISAL DAAGFALTNE DFQRFIELEA DVQQEVTGRP SHEFVNWRPE HPLPTYSPVK HGGEIGALQS EVGVAWREID MMRDEIDVAR AKIEMLHEEL DALYRSTSWR ITAPLRGMRR LITRFADRPS PLSLARATLK HGTECYQTKL LADPQVSRQR IVHVIGNFLT GGSSRLVVDL FERLGHLYEQ EVVTQYNPDP PNYTGIPIHE FSGGDAWDKF ITYLRLYRPD LVHVHYWEDP HWYGTMIKAA REFGCKIIQN VNTPTAPYMD SCISRYIYVS DYVKTRFGKR DQPSITIYPG SNFKLFSRDK SRPVPDDCIG MVYRLDIDKL HRKSIDVFIK VVQKRPQTKV IIVGGGPYLE PYKAAVKAGK VEHAFTFTGF VPYESLIGFY ARMSLFVAPV WKESFGQVGP FAMSMGLPVV GYNVGALAEI VGDCGLLAPR GNSEALAEII IGLLDDKERR EQIGSRNRDR AHKLFSVENM VNDYLKLYQE LIGNTQ
|
| |