Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nham_1062 |
Symbol | |
ID | 4031629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrobacter hamburgensis X14 |
Kingdom | Bacteria |
Replicon accession | NC_007964 |
Strand | - |
Start bp | 1191438 |
End bp | 1193789 |
Gene Length | 2352 bp |
Protein Length | 783 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637969560 |
Product | glycosyl transferase family protein |
Protein accession | YP_576370 |
Protein GI | 92116641 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGCG TCATCCGAGC CCGCAAGGAA CCGATCGCCA AGCCGAAGCT GGTCAGCCGA CTCGCGCTCT GCGACGACGG CACGGTGTCC GGTTTCGTGT TCGATCCGCA GACTCCGGAA CGCCACTTTA CGGTCGACAT CCTGCTTGAC GGCCTGGTGC TGAAGACCGC CTACGCCGAT GCCTTCGTCC CCGAACTCTC CCGGCAAGAC CGGAACGGCG CTTGCGGTTT CGCGGTGACC ATCGAGCCCG ATCTGCTCCG TGCCGCGCGG CATCTTTCGG CGCGCCTCGC CAATCTCGGG ACTTCCGTCG GCCACCCCAT CGACCTTGAG AATGACAGCG CAGCCCTGGT CGATCTGCGC CCGACCTGCA AACTGCGTTG GCTGGGTGGC CTGCACTTCC AGGGCTGGAT CGACAGCGAA GCCGCCGTCA CCCTGGAAGC GATCGTCGAT GGAGAATCCG TCGCGCAGGT CCGCGCCGTA GCCTGGGCGC ACATCGGCGG CGATACCGAG GGGGACGCCT CGCGCAACAT TCGCGCCTTC GATTTTCACG CCCCGCAACG CTTTGCCGAC GGGCGTGTCC ATCGAATCCT GCTGCGGAAA GAGAATGGCG AACAAATTCC GGCAACGGCT GTCTTCGTTG CATTTCCCGA TGGCCTCGCC GGAATGATCG ACGTCATCGG CGGTTATGGC GCCGAGCGGT TGCGCGGCAA ACTGTACGAT CAGCTTATTC CCGCTTCGCT GCCCCTGCAC GACTATGCCG ACTGGCGCGA TCGCTTCCCG CTGCCGGAGC CGCAGCCAAG CCCGCTTCGG CTGGCCGTCG TGATCGCCGG CAGCGCCGGA GCCCAACAGA CGCTCTCCAC CCTGGAGACG CAAAGCCACG AGAACTGGAC CGCGGGCGCG ATCGACGGTC AGCCGCTCCT GATCGACAGC GACGCCCTGC TGGAGTTTCT CGAGAACGCC GCATCCGATG CGCGGCACGT CGTCGTGACC ATGGCGGGCG TGTCGCTCGA ACGCAACGCC TTGGCACGCA TCGCCGCCGC ATTCGACGTC CATCCCGACG CGGTGGCGAT ATATGGCGAC CTCGACTTTC TCGCTGACGA CGGCCGCCTC TGGCCGCTTG CCTTTCCGGC GTTCGATTAC GAAAGGATGC TCGAGCAAGG GTATTGCGCG CATCTGTTCG CGGTCCGGCG CGACGCCCTG ATCGCGGCCA TCGAGGCGCG TCCGGACAAC CTCTACCGCC TCTTCAACTG CCTGCTGGAT CAGGCCGGTC CGCTGCAAGC AGATATCCTC CACTTGCCCG GAGCGTTGGC GACGATGCCG AAGCTCGACC GGGCAAAAAC CGGCTCCCTG CTGTCCGCCG CGAGCTATCT GCACCTGCGG GCACGCGGAA TCGATGCGGA CATAACCGAG CAGCAAGGCA ATTTGTTTCC CGCTGTTCGA ATCAAGCGGC CGTTTTCGCA GCAGCGGGTG ACCGTGATCA TCCCGACGCG CGACCGCGTT TCTCTCCTGC GCCGATGTCT CGACAGCATC GCACCCGCGG TCGAACGCTG TGGCGCCGAT ATCCTTGTCG TCGACAACGA CAGCGCCCAT CCGGAGACGA TCGGCTTTCT TGCAGATCTG CCGCGACGCG GCATCCGGAC ATTGCGGATC GAGGGGCCGT TCAACTTCGC AAGGCTCAAC AATCAGGCCA TTGCTACGCT CGACAGCGAC ATCCTGTGCC TTCTGAACAA CGACATCGAG GCAAGCTCCG ATGACTGGCT CGAGGAGATG CTTACGCGCT TGGGCGAACC GGAGGTCGGT GCGGTCGGCG CGCTGTTGAC CTGGCCTGGC GGCATCATTC AGCACGGCGG GGTGGTGTTG GGGATGAATT TTTCGGTCGC CCACGCCTTC ACAGACCGAT TCAGCGGTGA CCCGGGCTTT CTCGATCAAC TCCTCGTGGC GCATGAATGC AGCGCGGTGA CCGCGGCCTG CCTTGCGACC CGACGAAGCG ACTATCTTGC GGTCGGCGGA ATGGACGAAG CCCGCTTCGC CGTGACATTC AACGATGTCG ACTATTGCCT CCGCCTGCGC GAGGCCGGCA AGCGCATCGT CCTGACGCCG CACGCGAAGC TGATCCACGC CGAATCCGCC AGCCGTGGCA GCGACAATCG CGCCAACCGG CGGGACCGGT TTGAACATGA GCTCAATTTG CTCCGTGCGC GCTGGGGCGA GGTGCTCAAC GACGACCCGG CCTACAATCC GCAACTGTCG CGCGATGGTG TTCCGTATGG CGGGCTGGCC TGGCCTCCGG GACCACGGGT CCCTCGGTAC AATCGGCCGC CGCGCGCCGG CGATCTGCCG TTGGGGTTTT GA
|
Protein sequence | MARVIRARKE PIAKPKLVSR LALCDDGTVS GFVFDPQTPE RHFTVDILLD GLVLKTAYAD AFVPELSRQD RNGACGFAVT IEPDLLRAAR HLSARLANLG TSVGHPIDLE NDSAALVDLR PTCKLRWLGG LHFQGWIDSE AAVTLEAIVD GESVAQVRAV AWAHIGGDTE GDASRNIRAF DFHAPQRFAD GRVHRILLRK ENGEQIPATA VFVAFPDGLA GMIDVIGGYG AERLRGKLYD QLIPASLPLH DYADWRDRFP LPEPQPSPLR LAVVIAGSAG AQQTLSTLET QSHENWTAGA IDGQPLLIDS DALLEFLENA ASDARHVVVT MAGVSLERNA LARIAAAFDV HPDAVAIYGD LDFLADDGRL WPLAFPAFDY ERMLEQGYCA HLFAVRRDAL IAAIEARPDN LYRLFNCLLD QAGPLQADIL HLPGALATMP KLDRAKTGSL LSAASYLHLR ARGIDADITE QQGNLFPAVR IKRPFSQQRV TVIIPTRDRV SLLRRCLDSI APAVERCGAD ILVVDNDSAH PETIGFLADL PRRGIRTLRI EGPFNFARLN NQAIATLDSD ILCLLNNDIE ASSDDWLEEM LTRLGEPEVG AVGALLTWPG GIIQHGGVVL GMNFSVAHAF TDRFSGDPGF LDQLLVAHEC SAVTAACLAT RRSDYLAVGG MDEARFAVTF NDVDYCLRLR EAGKRIVLTP HAKLIHAESA SRGSDNRANR RDRFEHELNL LRARWGEVLN DDPAYNPQLS RDGVPYGGLA WPPGPRVPRY NRPPRAGDLP LGF
|
| |