Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4588 |
Symbol | |
ID | 5833959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 5126000 |
End bp | 5128078 |
Gene Length | 2079 bp |
Protein Length | 692 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641370382 |
Product | hypothetical protein |
Protein accession | YP_001642027 |
Protein GI | 163853984 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.345219 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCCTCG GCCCGCTGCG GAGCGCGGCC GCGTTCCTCG CTGCCGTCCT CGTCGGATCG CCGCTCTTCG GCTTCACCGC GGCGGTTCGG GGCGGCGAGG CGAGAGCGCC GCTGGTCCTG CGGGTTGCGC CCACCGGAGA CAGAGCCCCC CGTCGCGACA ACCGCTTCGC GGATCTGCCG CGGGCGCTTG CATACGTCGC CGCGCTGCGC CGTCAGGGGG AGGGGCGGGC GATCGTCGTC GAGCTGGAAC CCGGAACGCA CCGGATCTCG GCGCCCGTCC GGATCGGCCC CGACCATGCC GGCACGGCCG GGGCGCCCCT GATCCTGCGC GGGGCCGACG ACGGATCGAG CCGGCTCGTC GGCAGCGTGC CCCTGGCGCC CGCATCGCTG CAGCCGCGCC TGCGCGCCCG GTTGCCCGTC TCTGCCCGCG GTGCGGTGCG CGCCTACCAA TTGCCCGAGG CTTTGCGCGG GGAGCTCGCC TACCGCGTGC CGCGGCGCCT ACGCGAGACG CACCCGCGCG TGACCGAGAT CTTCGATGCG GGCGGTGCGC TGCGCCCGGC CCAGTGGCCA AATCCTGGGC CGAACCCTGG GCCAAATCCT GGGCCGAACC CTGAACCGAA CTCCGGCTGG ACGACGGTCG CCGCGGCCGA AGCGGGCGGC ATGGCCTTCA CCCTCAAGGA CGCGGCGGGC CTGCCCGATC TGTCCCTGGA GCGCGACCTG TGGGTGGAGG GCTTCTGGCG CTGGGACTGG CTGCTGGAAA CGCTGCACGT GGCGCAGGTC GATCAGCGCC GCCGCCGGCT CGAACTGGAC CATCCGCCCT ACGAGGGCAT CCGCGACGGC GCCCGGATGC GGCTGGTCCA TGCGCTCGGT GCCCTCGATG AACCCGGTGA ATGGTGGCAC GACGGCGAGA GCGGCCTGCT GCTGGCCTGG CCGTCTCCCG GCGCGGCCGA CCTCGAAGTC AGCCTCGCCG AGACGCTGAT CCGGGCCGAT GGCGCGCGGC ATCTACGTAT CGAGCGGCTT CGGCTGGAGC GCGCGCGCGG CGACCTGATC GTCGTGCGGG GGGGCGAGGA TATCGAGATC CGCGCGAGCG AACTGGCCTG GGCGGCAGGC CGGGCGGCGG TGTTCGAGGG GGTGACCGGG GGTGGCGTCT CCGGCAGCGC GATCCATGAT ATCGGCGCGA GCGCGGTCCG CCTCGTCGGC GGCGACCGCG CCACGCTCCG GCGGGGCGGG CTGTTCGTGC GCGACACCCG CTTCACCCGC TTCTCGCGGC TGAGCCAGAC CCAGAGTTCC GCGATCGAAC TCGACGGCGT CGGCGCGGAG GCGATCGGAA ACCTCATCAC CGACGCGATC GGCTACGCCA TCTACTTGCG CGGCAACGAT CACGTGTTTC GCGGCAACGA GGTCGCCCGG CTCATCCACG GCCTGAGCGA TACCGGCGCG ATCTATGCCG GACGCGACTT CACCGCCCGC GGCTCGATCA TCGAGGACAA TTACGTCCAC GACATCCGCA CCGTGCCGGG CATGGAGGTG AAGGGCGTCT ATCTCGACGA CATGGCGAGC GGCTTCACCA TCCGCCGCAA CCTATTCGTC GATGTGCAGC AACCGGTCTT CATCGGCGGC GGCAACGACA ACACGATCAC CCGCAACGTC TTCGTCGCGT CGAGCCCGAT GGTCGCCCTC GATGCGCGGG GTCTGACGTG GATGAAGCCG TCGCTGAACG AAGCGGATTC GGAGTTCCGG GCCGCCTTCG CTGCGATGCC GCTCGACTCC GCGCCCTGGC GGATGCGCTA CCCCAAGCTT GCGGAGGCGC TGACCGATGA GCCGGGCGTG GCGCGCAACA ACCAGATCGT CGACAACGTG AGCATCGGCA GCGACGAGCT CGCATTCACC GACAAGGCGG AGGCGGGCCG GCAGATCATT CTGTTCAACA CCCGCCTCGA TGGTCCGGTC CCGAATCCGA GCGACCTCGA TGCGCTGGCC CGCTTCACCG CCGAGCGCGG CATCACGCTT CGCCTCGACC CGTCGAAGAT GCGGCGGGAC GGGCTACCCG CCTCGCCGTT CGCGGAGGCG CGGCGGTGA
|
Protein sequence | MSLGPLRSAA AFLAAVLVGS PLFGFTAAVR GGEARAPLVL RVAPTGDRAP RRDNRFADLP RALAYVAALR RQGEGRAIVV ELEPGTHRIS APVRIGPDHA GTAGAPLILR GADDGSSRLV GSVPLAPASL QPRLRARLPV SARGAVRAYQ LPEALRGELA YRVPRRLRET HPRVTEIFDA GGALRPAQWP NPGPNPGPNP GPNPEPNSGW TTVAAAEAGG MAFTLKDAAG LPDLSLERDL WVEGFWRWDW LLETLHVAQV DQRRRRLELD HPPYEGIRDG ARMRLVHALG ALDEPGEWWH DGESGLLLAW PSPGAADLEV SLAETLIRAD GARHLRIERL RLERARGDLI VVRGGEDIEI RASELAWAAG RAAVFEGVTG GGVSGSAIHD IGASAVRLVG GDRATLRRGG LFVRDTRFTR FSRLSQTQSS AIELDGVGAE AIGNLITDAI GYAIYLRGND HVFRGNEVAR LIHGLSDTGA IYAGRDFTAR GSIIEDNYVH DIRTVPGMEV KGVYLDDMAS GFTIRRNLFV DVQQPVFIGG GNDNTITRNV FVASSPMVAL DARGLTWMKP SLNEADSEFR AAFAAMPLDS APWRMRYPKL AEALTDEPGV ARNNQIVDNV SIGSDELAFT DKAEAGRQII LFNTRLDGPV PNPSDLDALA RFTAERGITL RLDPSKMRRD GLPASPFAEA RR
|
| |