Gene Mext_0335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0335 
Symbol 
ID5831704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp379327 
End bp382113 
Gene Length2787 bp 
Protein Length928 aa 
Translation table11 
GC content67% 
IMG OID641366120 
ProductPII uridylyl-transferase 
Protein accessionYP_001637830 
Protein GI163849787 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2844] UTP:GlnB (protein PII) uridylyltransferase 
TIGRFAM ID[TIGR01693] [Protein-PII] uridylyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.636952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGACC CCGTTCCCGT CCTGGAAAAG ATCGTCGCCT CCCTCGACCG CGACGCCCGC 
GAGCCGACCA AGCTGCGGGG GCTCCTTGTG CCCGAGTTGC GCAAGGTGAT CGAGACCGGT
CACGCGGAGG CCGAACGGCT TCTCCTCAAG GAACGCGACG GCCTCGCCTG CGCGCAGCGC
CTGAGCCGCC TCACCGATGC GGTGGTGCGG GCGATCTACG ACGCTGTGGT CTGGCGCCTC
TATCCCAACG ACAACCCGTC CACCGGCGAG CATCTCGCGA TCGTCGCCAC CGGTGGCTAC
GGCCGGGGTA CGATGGCGCC GGGCTCGGAC ATCGATCTGC TGTTCCTGCT GCCCTACAAG
CAGACCGCGT GGTCCGAGAG CGTCGTCGAG GCGATGCTCT ACGTGCTGTG GGACCTCAAG
CTGAAGGTCG GCCACGCGAC GCGCTCCGTC GAGGAATGCC TGCGCGAGGG CCGCGCCGAC
ATGACGATCC GCACCGCTTT GCTGGAATCC CGCTTCCTGT TCGGCTCGCG CTCGCTGTTC
GAGGAGATGG TCACCCGCTT CGACACCGAA CTGGTGATCG GGTCGGCCTC GGAGTTCGTG
GACGCCAAGC TGCGCGAGCG CGATGCCCGC GTCGCCAAGG CAGGCGCCTC GCGCTACCTC
GTCGAACCCA ACGTCAAGGA CGGCAAGGGT GGCTTGCGCG ATCTCAACAC CCTGTTCTGG
ATCGCGAAAT ACACCTACCG CGTGCGCAAT CAGGCCGAGC TGGTCCATGC CGGGCTGTTC
ACGCCGGACG AGTACCGCCT GTTCGAGCGC TGCGAGGAAT TCCTGTGGCG GGTGCGCTGC
CACATCCACT TCGTGACGGG GCGGCCGGAG GAGCGGCTGT CGTTCGGGCT GCAGCCCAAG
ATCGCCGAGC GGCTCGGCTT CGGCTCCCGT GGCGGCCTCT CGGGCGTCGA GCGCTTCATG
AAGGCGTATT TCCTCATTGC CAAAGATGTC GGCGACCTCA CCGCCATCGT CTGCGCCGAG
TTGGAGGCAC GCCACGCCAA GCGCACGCCG GTGCTCGACC GCTGGATCGG CCGCTTCCGC
GACCGCTTCC GCGCGACCTC GATCGAGGCC GAAGATTTCT GGATCGACAA CGGCCGGGTC
AACATCCGCG GCGAGGACGC GTTCGAGCGT GATCCGGTCA ACCTGATCCG TCTGTTCTGG
CTCGCCGACC GGCACAACCT CCCGATCCAC CCGGACGCGG CGCGGCTGGC CAACCGCTCG
CTCAAGCTCA TCACCCACGC GGTGCGCATC GATCCGGAGG CCAACCGGCT CTTCCTCGAC
ATCCTCACCT CGAAGAACGC GCCCGAAACG ATCCTGCGAT CGATGAACGA GGCGGGCGTG
CTCGGCCGGT TCATCCCGGA ATTCGGCCGC ATCGTCGCGA TGATGCAGTT CAACATGTAC
CACCACTATA CGGTGGACGA GCATCTGCTG CGCTCGCTCG GCGTGCTCGC GGCGATCGAT
TCGGGCCGGG TGCGCGACGA GCATCCGCTC GCCACGCGGC TCATCGATAC GATCCACAAC
CGCCGCGCCC TCTATGTCGC GATCCTGCTC CACGACATCG CCAAGGGCCG GCCGGAGGAC
CACTCGATCG CGGGTGCGGC CATCGCCCGC AAGCTCGGGC CGCGCTTCGG ATTGAGCCAA
GCGGAGACCG AGACGGTCTC GTGGCTGGTC GAGCACCACC TGCTGATGTC GATGACGGCG
CAGAGCCGCG ATCTCTCCGA CCGCAGGACG ATCGAGAAGT TCGCGAGTGA GGTGCAGAGC
CTGGAGCGGC TGAAGCTGCT CGCGATCCTC ACCGTCGCCG ACATCAAGGC GGTCGGCCCC
GGCGTGTGGA CGGCCTGGAA GGGCACGCTG CTGCGCACCC TCTACGACGA GACCGAAGTC
GTCCTGTCCG GCGGCCATTC GGAGATCGCC CGCACCGACC GGGTGCGGCT GATCCAGATG
GCCCTGCGCG AGCAGCTCTC CGACTGGAAC TCGGAGCTGT TCGATGGCTA CGCCGCGCGC
CACAACCAAG CCTACTGGCT CAAGGTCGAC TCGACGCGCC ACTTCAAGAA CGCCCGCTTC
CTGCGCACGG TGATGGAAGA GGGCCGCACC AGCGCCACGA CCTACGAGAC CGATCCGGTG
CGCGGCGTGA CCGAGTTGAC GGTCTATTCC CCCGATCATC CGCGGCTGCT CGCCATCATC
ACCGGCGCCT GCGCGACCAT GGGCGGCAAC ATCGTCGATG CGCAGATCTT CACGACCACC
GACGGCTTCG CCCTCGATTC GATCTTCATC TCCCGCGCCT TCGAGCGGGA CGAGGACGAG
CTGCGCCGGG CCGGCCGCAT CGCCACGGCG ATCGAGCGGG CGCTGAAGGG CGAGATCAAG
ATCGCCGAAC TCGTCGCCGA CAAGCACCCG AAACAGCCAC CCAAGACCTT CCTCGTGCCG
CCGGACGTGT CGATCGACAA CGCCCTGTCG AGCCGCGAGA CCGTGGTCGA GATCACCGGC
CTCGACCGGC CGGGCCTGCT CTACGAGCTG ACGACGGGAC TCAACCGCCT GAGCCTCAAC
ATCACCTCGG CGCATGTGGC GACCTTCGGC GAGCGGGCAG TGGACGTGTT CTACGTGACC
GACCTCACCG GCACCCGCGT GGTGCAGCCC GACCGCCTCG CGATGATCCG CGCCGCGGTG
ATGGAGGTGT TCGCCAGCGA CGTCGCTGCG CTCCGCGCGG AAGGGCTCGA CGCCCTCGTC
GATTCGCCGC CGCCGCGGGA ACTCTGA
 
Protein sequence
MFDPVPVLEK IVASLDRDAR EPTKLRGLLV PELRKVIETG HAEAERLLLK ERDGLACAQR 
LSRLTDAVVR AIYDAVVWRL YPNDNPSTGE HLAIVATGGY GRGTMAPGSD IDLLFLLPYK
QTAWSESVVE AMLYVLWDLK LKVGHATRSV EECLREGRAD MTIRTALLES RFLFGSRSLF
EEMVTRFDTE LVIGSASEFV DAKLRERDAR VAKAGASRYL VEPNVKDGKG GLRDLNTLFW
IAKYTYRVRN QAELVHAGLF TPDEYRLFER CEEFLWRVRC HIHFVTGRPE ERLSFGLQPK
IAERLGFGSR GGLSGVERFM KAYFLIAKDV GDLTAIVCAE LEARHAKRTP VLDRWIGRFR
DRFRATSIEA EDFWIDNGRV NIRGEDAFER DPVNLIRLFW LADRHNLPIH PDAARLANRS
LKLITHAVRI DPEANRLFLD ILTSKNAPET ILRSMNEAGV LGRFIPEFGR IVAMMQFNMY
HHYTVDEHLL RSLGVLAAID SGRVRDEHPL ATRLIDTIHN RRALYVAILL HDIAKGRPED
HSIAGAAIAR KLGPRFGLSQ AETETVSWLV EHHLLMSMTA QSRDLSDRRT IEKFASEVQS
LERLKLLAIL TVADIKAVGP GVWTAWKGTL LRTLYDETEV VLSGGHSEIA RTDRVRLIQM
ALREQLSDWN SELFDGYAAR HNQAYWLKVD STRHFKNARF LRTVMEEGRT SATTYETDPV
RGVTELTVYS PDHPRLLAII TGACATMGGN IVDAQIFTTT DGFALDSIFI SRAFERDEDE
LRRAGRIATA IERALKGEIK IAELVADKHP KQPPKTFLVP PDVSIDNALS SRETVVEITG
LDRPGLLYEL TTGLNRLSLN ITSAHVATFG ERAVDVFYVT DLTGTRVVQP DRLAMIRAAV
MEVFASDVAA LRAEGLDALV DSPPPREL