Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1000 |
Symbol | |
ID | 5835759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 1077493 |
End bp | 1078971 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641366782 |
Product | hypothetical protein |
Protein accession | YP_001638476 |
Protein GI | 163850433 |
COG category | [S] Function unknown |
COG ID | [COG5373] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGACT ATTATCCGTT GCTGGCACGC GCGCTCGACG CCCTGCCCGA CCGTTCCCCG GCCCTGCGCC GGGCCGTCTA CGACCGTGCA CGCAGCGCGC TGATCGCGCA GCTCCGCTCG CTCGACCCGC CGGTGCCGGA AGCGGACATC GACCTCGAGC GCAAAGCGCT CGACACGGCG ATCGGCCGCC TGGAGGCGGA ATACGAGGCG CCGCCCGCGG CCGTGACGAC GCCGGCCGAG GAGCCTGCGG CCGCCGCTCC AGAGGCACCC CCTCCCCCGC CGCCCGAACC GACCCGGCCT GAGCCGCTCT CGCCCGGCCC CCTGCCGCCG ACGCTGCCGG AGCCGGAGCC GCCGCAGACG TCCGACGAGC CGCTGGTTCT GCCGCCGGCC TCGGTGCCGG CTGGAATCGG ATCGGACACC GGACCGGCCG AGCCGAAGCC GCCGACGGAG ACGGTGCCGT TCATGCCGCC GACCCGCCGG CCGAAAGCCG ACGAGGCCGT GAAGCCTGAG CCGGAAAACG AGAACGGCTT CATCCCACCG GTTGCCGAGC CTGAGCCGGT CTCCGTCGCG TCCGAGGCCG AGGCCGGCGC CGATCCGGCC TCACCCGAGA CGAACGGAGC CGGCGAGGCG GGCAACGGCC GCCAGCGCCC GCGCATCGAC GTGGTGACGC CGCCCGAGGG GCGTTCGCGC CTGCTGCGCA ACCTGTTTGT CGGCGGCGTG CTCGCGGCGG TGATCGCGCT GATCGCGGTG GCGGCTTTCT TCCTGCGTGA CCAGCCCTCC GATCTCCAGC AGAGCGCGGC CGAGCAGGAG ACGCCGGCCG AGCAGCCGGA CGCGAAGTTC TCGGATCGGG TCGGAGCCGA GCGCAACGAG GCCGAGGCCC GGCCGAAGCC GGCCGCTCCC GGCGCCGCCC CGGCCCAGCC GGAGGTGACC GTCTCGCAGC GGGCGATCCT CTATGAGGAG AACCAGAGCG ACACGCGCGC CCAGCCGATC GCGACCAACG GCCATACGGT CTGGCGGCTG GAAGCGGTGA ACGGCGAACA GGGCGAGCCG TTGCAGACGG CACTCCGCGT CAACGTGGAG TTCCCGGAGG CGGGGCTGAC GCTGGCGATG ACCATGCGCA AGAATCTGGA TGCGACGCTG CCCGCGTCTC ACACCGTCGA ACTCGCTTTC ACCAACAACG CGGATGCCGG CGCGCAGCGC GCGGTGCAGA ATATCGGCCT GCTTCAGCTC AAGGACGAGG AAGCCTCCCG CGGCTCCCCG GTCTCGGGCC TGCCGGTGCG GGTGCGCGAG AACCTGTTCC TGATCGGTCT GTCGTCGCTG AAGAGCGACG TGGACCGCAA CACCGAGCTG CTGCTGCACA AGAACTGGTT CGATCTGGCC CTGACCTACG CGAACGGCCA GCGGGCGGTC ATCAGCTTCG AAAAGGGCAG CGCCGGCGCC CAGGCTCTGC AGAGCGCCTT CGCGCAGTGG CGCGACTAA
|
Protein sequence | MADYYPLLAR ALDALPDRSP ALRRAVYDRA RSALIAQLRS LDPPVPEADI DLERKALDTA IGRLEAEYEA PPAAVTTPAE EPAAAAPEAP PPPPPEPTRP EPLSPGPLPP TLPEPEPPQT SDEPLVLPPA SVPAGIGSDT GPAEPKPPTE TVPFMPPTRR PKADEAVKPE PENENGFIPP VAEPEPVSVA SEAEAGADPA SPETNGAGEA GNGRQRPRID VVTPPEGRSR LLRNLFVGGV LAAVIALIAV AAFFLRDQPS DLQQSAAEQE TPAEQPDAKF SDRVGAERNE AEARPKPAAP GAAPAQPEVT VSQRAILYEE NQSDTRAQPI ATNGHTVWRL EAVNGEQGEP LQTALRVNVE FPEAGLTLAM TMRKNLDATL PASHTVELAF TNNADAGAQR AVQNIGLLQL KDEEASRGSP VSGLPVRVRE NLFLIGLSSL KSDVDRNTEL LLHKNWFDLA LTYANGQRAV ISFEKGSAGA QALQSAFAQW RD
|
| |