Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_3721 |
Symbol | |
ID | 5831027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 4118414 |
End bp | 4120249 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641369511 |
Product | allophanate hydrolase |
Protein accession | YP_001641166 |
Protein GI | 163853123 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases |
TIGRFAM ID | [TIGR02713] allophanate hydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.640157 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.348876 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGTTC CCGCCTTCCC GAGCCTCAGC GCGCTCCACG CCGCCTATGC CGACGGGCTC TCGCCCGAGG CGGTTCTGGC CGAGATCGAC CGCCGGATCG CAGCCGCCGA CGATCCCGGC ATCTTCCTCG CGCGGGTGCC GGCGGCGGAG ATGCAGGCCG CCGCCCGCGC CCTGGGCCCC TTCGACCCCG CGAACAAACC GCTCTGGGGC GTGCCCTTCG CGGTGAAGGA CAATGTCGAC GTCGCAGGCC TGCCCACCAC CGCCGCCTGC CCCGACTTCG CCTATACGCC GCAGGCGACC GCCCCGGCGG TCGAGCGCCT GCTCGCAGCC GGCGCGATCC TCGTCGGCAA GACCAATCTC GATCAGTTCG CCACGGGGCT CGTCGGCGTG CGCACGCCCT ATCCGGTGCC GAAGAACGCC ATCGATCCCG CCATCGTGCC GGGCGGCTCC TCGTCCGGCT CGGCGGTGGC GGTGGCGCGC GGCCTCGTCA CCTTCGCGCT CGGCACCGAC ACGGCGGGTT CGGGCCGCGT GCCGGCCGGC CTCAACAACA TCGTCGGGCT CAAGCCCTCC CTCGGCAGCG TCTCGGGCCG CGGCGTGGTC CCGGCCTGCC GCACGCTGGA CACCCTCTCC GTCTTCGCCG GCACCGTGAC GGAGGCGGAT GCGGTCTTTC GGATCATGGC CGGGTACGAT CCGCAGGATC CGTATTCGCG CGCCCTGCCC GTTCCGCCCC GCCCCGGTGC CCTGCCGCCG GGCTTGCGCG TCGGCGTGCC GGATGCGGCG GGCCTGATCT TTGCCGGCGA CGCACTCTCG GCGTCGGCTT TCGACGGGGC ACTCGCCGAC CTGCACACGG TGACTGGCGC TGCAGCGACG GCGGTCGATC TCGCGCCCTT CTTCGCGGTG GCTGGGCTTC TCTATGCCGG CCCCTGGGTG GCCGAGCGCT ATCAGGCGAT CCGCGGTTTC ATGGAGGAGC GCCCGGAGGC CCTGCACCCG ACGACGCGGG CGATCATCAG CGCGGCCACC GGCCACTCGG CAGCGGATGC CTTCGCCGGC CTCTACCGTC TCGCCGAACT GCGCCGGGCA ACCGAACCGG TCTGGCGCGG AATCGACGTG CTCGTGGTGC CGACTTATCC GCGTCCGCGC CGCGTCGCCG ACCTCGCCGC CGACCCGGTC GGCCCCAACA GCGAGCTCGG CACCTATACC AACTTCGTCA ACCTGCTCGA TCTCTGCGCC CTCGCGGTGC CGGGGCGCTT CCGGGCCGAC GGCCTCCCGT CCGGCGTCAC GCTGATCGCG CCGCGCGGGG CCGACGGTCT CATCGCCGAA CTCGGCGCCC GCCTCCACGC GGCGGCCGGC GGCACCCTCG GGGCGAGCGG CGTGCCGATT CCGGCGGAGG CCGCATCGCC GGGGAAGCGT GCGCAAGGCA GGGATCGCGC GCAAGGGGAC GAGATCGAGA TCGCGGTGGT CGGCGCGCAC CTGTCCGGCC TGCCGCTCAA CGGCGAGCTG ACCGCCCGCG GCGCCCGCTT CCTGCGCGCC GTCCCCACGA CGCCGGATTA CCGCCTGCAC GCGCTGCCCG GCGGACCACC GGCCCGGCCC GGCCTGATCC GCGTCGCGCC CGGTTCGGGC CATGCGATCG AGACCGAAAT CTGGGCGCTG GCGCCGGACG CTTTCGGCTC CTTCGTGGCG GGGATTCCCG CGCCGCTCGC GATCGGCACG CTGTCGCTGG CCGACGGCAC GGCGCCGAAG GGGTTTTTGG CGGAAGCGGC CGGGCTGACG GGCGCGCGCG ACATCAGCGA TCACGGCGGC TGGCGCGCCT ATCTCGCCTC CATAACAGCG ACGTAG
|
Protein sequence | MPVPAFPSLS ALHAAYADGL SPEAVLAEID RRIAAADDPG IFLARVPAAE MQAAARALGP FDPANKPLWG VPFAVKDNVD VAGLPTTAAC PDFAYTPQAT APAVERLLAA GAILVGKTNL DQFATGLVGV RTPYPVPKNA IDPAIVPGGS SSGSAVAVAR GLVTFALGTD TAGSGRVPAG LNNIVGLKPS LGSVSGRGVV PACRTLDTLS VFAGTVTEAD AVFRIMAGYD PQDPYSRALP VPPRPGALPP GLRVGVPDAA GLIFAGDALS ASAFDGALAD LHTVTGAAAT AVDLAPFFAV AGLLYAGPWV AERYQAIRGF MEERPEALHP TTRAIISAAT GHSAADAFAG LYRLAELRRA TEPVWRGIDV LVVPTYPRPR RVADLAADPV GPNSELGTYT NFVNLLDLCA LAVPGRFRAD GLPSGVTLIA PRGADGLIAE LGARLHAAAG GTLGASGVPI PAEAASPGKR AQGRDRAQGD EIEIAVVGAH LSGLPLNGEL TARGARFLRA VPTTPDYRLH ALPGGPPARP GLIRVAPGSG HAIETEIWAL APDAFGSFVA GIPAPLAIGT LSLADGTAPK GFLAEAAGLT GARDISDHGG WRAYLASITA T
|
| |