Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2247 |
Symbol | |
ID | 4785379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 2406407 |
End bp | 2407576 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640090815 |
Product | tetratricopeptide repeat protein |
Protein accession | YP_001021438 |
Protein GI | 124267434 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTTTG ATCCGCTGAC CACGCTGCCG CTGGCCCTGC TCGGCCTGAT GGTGGCCTTC GCGCTCGGCT GGCTGGCTTC GCGCTTCGAC GTCCGCCAGT GGAAGCGCGA GCAACAGGAA TCGCCGAAGG CCTACTACAA GGGCTTGAAC CTGCTGCTCA ACGAACAGCA GGACAAGGCG ATTGACGCCT TCATCGAGGC GGTGCAGCAC GACCCGGGCA CCTCGGACCT GCATTTCGCG CTCGGCAACC TGTTCCGCCG ACGAGGCGAG TACGAGCGTG CGGTACGGGT CCATCAACAC CTGCTCGGTC GCGGTGACCT GCCCGCCACC GAACGCGAGC GTGCCCAGCA CGCGCTGGCG CAGGACTACG TGAAGGCCGG TCTGTTCGAC CGCGCCGAGG CGGCCTTCCG TGCACTCGAG GGCACGGCGT TTGCCACCGA TGCGCGCCTC GATCTGTTGA CGCTGCACGA GCGCTCGCGC GACTGGCATG CCGCCATCGA GACGGCTCGC AAGCTGGAGG CCGCGGGCGC CGGATCCTTC GCCAACCGCA TGGCGCACTA CGGCTGCGAG ATCGCGCTGG AAGCGGATGC ACGCCGCCGC CCCGATGAAG CCGAGGAAGC CCTGCGCAAG GCGCGTGAAG CTGCGCCGCA GGCACCGCGA CCGCGGGTCA TCGCCGGGCA GCGCCTTGCG CGCGCCGGCC AGCATCGCGA AGCGCTCTCG GCATGGGACG AACTTCTCGC CACGCAACCG AGCGCCTTCG CGCTGATCGC ATCGGACTAT GCCAACAGCG CGCTGGCCTG CGGTGACGCA GCCGACGCCC GCGGGCGGCT GGAAGCCGTT TACGACCGTG TTCCGAGCCT CGACATCGTG ACGGCGCTGC AACAGCTCGA ACCGGATCCG GCCGCGCGCC ACGAACGATT GCGCCGTCAC CTGCAGGCAC ACCCGACACT GTCGGCCGCA TCGGCCCTGC TGAAGGAGCA GCAGGCTCAA GGACTGGCCC CAACGTCGAC CGACGCCGAG CAGCTGCAGC AGATCACGGC CGCCGCCGCC CGGCCGATCC GGCGCTTCCG CTGCGCAGCC TGCGGCTTCG AGGCGCAGCA CTACTTCTGG CAATGCCCCG GGTGCCACAG CTGGGACAGC TATCCGCCCA CGCGGTTGGA GGACCAGTGA
|
Protein sequence | MDFDPLTTLP LALLGLMVAF ALGWLASRFD VRQWKREQQE SPKAYYKGLN LLLNEQQDKA IDAFIEAVQH DPGTSDLHFA LGNLFRRRGE YERAVRVHQH LLGRGDLPAT ERERAQHALA QDYVKAGLFD RAEAAFRALE GTAFATDARL DLLTLHERSR DWHAAIETAR KLEAAGAGSF ANRMAHYGCE IALEADARRR PDEAEEALRK AREAAPQAPR PRVIAGQRLA RAGQHREALS AWDELLATQP SAFALIASDY ANSALACGDA ADARGRLEAV YDRVPSLDIV TALQQLEPDP AARHERLRRH LQAHPTLSAA SALLKEQQAQ GLAPTSTDAE QLQQITAAAA RPIRRFRCAA CGFEAQHYFW QCPGCHSWDS YPPTRLEDQ
|
| |