Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_2185 |
Symbol | |
ID | 6134737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 2432845 |
End bp | 2435655 |
Gene Length | 2811 bp |
Protein Length | 936 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641642412 |
Product | PII uridylyl-transferase |
Protein accession | YP_001769080 |
Protein GI | 170740425 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG2844] UTP:GlnB (protein PII) uridylyltransferase |
TIGRFAM ID | [TIGR01693] [Protein-PII] uridylyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00224079 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.664857 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGACA CCGCCGCCGC CCTCGCCCAA CTGCTGTCCG GCATCGACCA GACGACCAGC GACCCGACCA AGCTTCGCGA GCGGCTCGTG CCCGGGCTGC GGCGGATCAT CGAGGAGGGG CGCGCCGAGG CCGAGCAGAA CCTGCTGCGC CACCGCGACG GGCTCGCCTG CGCCCGCGAG ATCTGCGCCC TGATGGACGC GGTCGTGCAG GCGATCTACC AGGCGGTGGT CAAGCGGCTC TACCGGGCCG ACAACCCGAC CGCGGGCGAG CACGTCGCCG TGGTGGCGAC GGGCGGCTAC GGCCGCGGCA CGCTGGCGCC GGGCTCCGAC ATCGACCTCC TGTTCCTGCT CCCCTACAAG CAGACCGCCT GGGGCGAGAG CGTCGTCGAG GCGATGCTCT ACGTCCTGTG GGACCTCAAG CTGAAGGTCG GCCACGCGAC CCGCTCGGTC GCCGAGTGCC TGCGCGAGGG GCGGGCCGAC ATGACGATCC GCACCGCCCT GCTGGAGGCG CGGTTCCTGT TCGGGGACCG CGCCCTGTTC GACGAACTCG TCGAGCGCTT CGACCGCGAG GTCGTGCAGG GCACGGCGGC GGAATTCGTC GAGGCCAAGC TCAAGGAGCG CGACACGCGG GTCTCGAAGG GCGGGGCCTC CCGCTACCTC GTCGAGCCGA ACGTCAAGGA CGGCAAGGGG GGCCTGCGCG ACCTCAACAC CCTGTTCTGG ATCGCCAAGT ACACGTTCCG GGTGACGCGG ACCCGCGACC TCGTGGAGGC CGGGCTGTTC ACCCTCGACG AAGTCAGGCT GTTCGACCGC TGCGAGGAAT TCCTGTGGCG GGTGCGCTGC CACATGCATT TCGCGACCGG GCGGGCGGAG GAGCGGCTCT CCTTCGGGCT GCAGCCGCGC ATCGCCGAGC GGCTCGGCTA CGCGCCCCGC GGCGGGCTGA CCGCCGTCGA GCGCTTCATG AAATCCTACT TCCTGATCGC CAAAGACGTG GGCGACCTCA CCGCCATCGT CTGCGCCGAG ATGGAGGCGC GCCACGCCAA GCGCCCGCCG GCGCTGAACC GCTGGTTCGG GCGCTTCAAG GAGCGCTTCC GCGCCCCCGA CCTCGACGCG GACGCTTTCC GCATCGACAA CGGCCGGCTG AACCTGCGCG ACGACGCCGC CTTCGAGCGC GACCCGGTCA ACCTGATCCG CCTGTTCTGG CTCTCCGACC ACCACGACGT GCCCATCCAC CCGGATGCGA GCCGGCTCGC GACCCGCTCG CTCAGCCTGA TCGGCCCGAT GGTGCGGGTC GACCGCGAGG CGAACCGCCT CTTCCTCGAG CTCCTGACCT CGGAGAACGC CCCCGAGACC GTGCTGCGGC ACATGAACGA GACCGGGGTG CTCGGGCGGT TCGTGCCGGA TTTCGGCCGC ATCGTCGCGA TGATGCAGTT CAACATGTAC CACCACTTCA CGGTGGATGA GCACCTGATC CGCTCGCTCG GCGTGCTGGC CAAGATCATG AGCGGCGAGG CGAAGGACGA GCATCCGGTG GCGCACCGGA TCGTCGGCAC GATCCAGAAC CGCCGCGCGC TCTTCGTCGC CACCTTCCTG CACGACATTG CCAAGGGCCG GAAGGAGGAC CACTCGATCG CGGGCGCCGC GGTGGCGCGC AAGCTCGGGC CGCGCTTCGG CCTCGAACCG GCCGAGACCG ACACCGTGGC CTGGCTGATC GAGCACCACC TGCTGATGTC GATCACGGCC CAGAGCCGCG ACCTCTCCGA CCCGAAGACG ATCGAGACCT TCGCGGCCGC GGTGCAGAGC CTGGAGCGGC TGAAGCTGCT GTTCGTGCTG ACCATCGCCG ACATCAAGGC GGTCGGCCCG GGCGTCTGGA CCGCCTGGAA GGCGACGCTG CTGCGCACCC TGTTCTACGA GACCGAGGTC GTGCTCTCCG GCGGGCACTC GGAGATCGCC CGCACGGACC GGGTGCGCCT CCTGCAGATG CGCCTGCGCG AGCAGCTCCC CGACTGGAGC GCCGAGGAAT TCGACGCCTA CGCGGCGCGG CTCTACGCCC CCTACTGGCT CAAGGTCGAC GCCGCGCGCC AGCTCAAGAA CGCGCATTTC CTGCGCGCCA CGGTGGCGGC GGGCCGCACG GTGGCGACCC ACGTCGAGAC CGACGCCTCC CGCGGCGTGA CCGAGCTCAC GGTCTACTCC CCGGACCACC CGCGGCTCCT CGCCATCCTG ACCGGCGCCT GCGCGGCGGC GGGCGGCAAC ATCGTCGACG CGCAGATCTT CACCACGGCC GACGGCTTCG TCCTCGACAC GATCGTGCTG TCGCGGGCCT TCGATCAGGA CGAGGACGAG ATGCGCCGCG CCGGCCGCAT CGCCACCGCG ATCGAGCGGG CGCTCAAGGG CGAGATCCGC ATCGCCGACC TCGTCGCCGA CCGCCACCCG CGCAAGGACC GGCCGCGCAC CTTCCAGGTC GCGCCCGACC TCTCGATCGA CAACGCGCTC TCCTCCCGCG AGACGGTGCT GGAGATCTCC GGCCTCGACC GGCCGGGCCT GCTCTACGAC CTCACCACGG CGCTGAGCCG GCTCAACCTC AACATCACCT CGGCCCACGT GGCGACCTTC GGCGAGCGGG CGGTGGACGT CTTCTACGTC ACCGACCTCA CCGGCACCAA GATCACCCAG CCCGACCGGC AGGCGACGAT CCGCCGGGCC GTGATGGGCG TGTTCGAGGG CGACGCCGCG GCCGCCCGCC CGCCCGGCCG GCGCGCCGCC GCGCCCCGCC CGAAGGCCGC CGTGCCCGGC GAACCCGCTG GCGAAGCTTG A
|
Protein sequence | MPDTAAALAQ LLSGIDQTTS DPTKLRERLV PGLRRIIEEG RAEAEQNLLR HRDGLACARE ICALMDAVVQ AIYQAVVKRL YRADNPTAGE HVAVVATGGY GRGTLAPGSD IDLLFLLPYK QTAWGESVVE AMLYVLWDLK LKVGHATRSV AECLREGRAD MTIRTALLEA RFLFGDRALF DELVERFDRE VVQGTAAEFV EAKLKERDTR VSKGGASRYL VEPNVKDGKG GLRDLNTLFW IAKYTFRVTR TRDLVEAGLF TLDEVRLFDR CEEFLWRVRC HMHFATGRAE ERLSFGLQPR IAERLGYAPR GGLTAVERFM KSYFLIAKDV GDLTAIVCAE MEARHAKRPP ALNRWFGRFK ERFRAPDLDA DAFRIDNGRL NLRDDAAFER DPVNLIRLFW LSDHHDVPIH PDASRLATRS LSLIGPMVRV DREANRLFLE LLTSENAPET VLRHMNETGV LGRFVPDFGR IVAMMQFNMY HHFTVDEHLI RSLGVLAKIM SGEAKDEHPV AHRIVGTIQN RRALFVATFL HDIAKGRKED HSIAGAAVAR KLGPRFGLEP AETDTVAWLI EHHLLMSITA QSRDLSDPKT IETFAAAVQS LERLKLLFVL TIADIKAVGP GVWTAWKATL LRTLFYETEV VLSGGHSEIA RTDRVRLLQM RLREQLPDWS AEEFDAYAAR LYAPYWLKVD AARQLKNAHF LRATVAAGRT VATHVETDAS RGVTELTVYS PDHPRLLAIL TGACAAAGGN IVDAQIFTTA DGFVLDTIVL SRAFDQDEDE MRRAGRIATA IERALKGEIR IADLVADRHP RKDRPRTFQV APDLSIDNAL SSRETVLEIS GLDRPGLLYD LTTALSRLNL NITSAHVATF GERAVDVFYV TDLTGTKITQ PDRQATIRRA VMGVFEGDAA AARPPGRRAA APRPKAAVPG EPAGEA
|
| |