Gene Mpe_A2247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2247 
Symbol 
ID4785379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2406407 
End bp2407576 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content69% 
IMG OID640090815 
Producttetratricopeptide repeat protein 
Protein accessionYP_001021438 
Protein GI124267434 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTTTG ATCCGCTGAC CACGCTGCCG CTGGCCCTGC TCGGCCTGAT GGTGGCCTTC 
GCGCTCGGCT GGCTGGCTTC GCGCTTCGAC GTCCGCCAGT GGAAGCGCGA GCAACAGGAA
TCGCCGAAGG CCTACTACAA GGGCTTGAAC CTGCTGCTCA ACGAACAGCA GGACAAGGCG
ATTGACGCCT TCATCGAGGC GGTGCAGCAC GACCCGGGCA CCTCGGACCT GCATTTCGCG
CTCGGCAACC TGTTCCGCCG ACGAGGCGAG TACGAGCGTG CGGTACGGGT CCATCAACAC
CTGCTCGGTC GCGGTGACCT GCCCGCCACC GAACGCGAGC GTGCCCAGCA CGCGCTGGCG
CAGGACTACG TGAAGGCCGG TCTGTTCGAC CGCGCCGAGG CGGCCTTCCG TGCACTCGAG
GGCACGGCGT TTGCCACCGA TGCGCGCCTC GATCTGTTGA CGCTGCACGA GCGCTCGCGC
GACTGGCATG CCGCCATCGA GACGGCTCGC AAGCTGGAGG CCGCGGGCGC CGGATCCTTC
GCCAACCGCA TGGCGCACTA CGGCTGCGAG ATCGCGCTGG AAGCGGATGC ACGCCGCCGC
CCCGATGAAG CCGAGGAAGC CCTGCGCAAG GCGCGTGAAG CTGCGCCGCA GGCACCGCGA
CCGCGGGTCA TCGCCGGGCA GCGCCTTGCG CGCGCCGGCC AGCATCGCGA AGCGCTCTCG
GCATGGGACG AACTTCTCGC CACGCAACCG AGCGCCTTCG CGCTGATCGC ATCGGACTAT
GCCAACAGCG CGCTGGCCTG CGGTGACGCA GCCGACGCCC GCGGGCGGCT GGAAGCCGTT
TACGACCGTG TTCCGAGCCT CGACATCGTG ACGGCGCTGC AACAGCTCGA ACCGGATCCG
GCCGCGCGCC ACGAACGATT GCGCCGTCAC CTGCAGGCAC ACCCGACACT GTCGGCCGCA
TCGGCCCTGC TGAAGGAGCA GCAGGCTCAA GGACTGGCCC CAACGTCGAC CGACGCCGAG
CAGCTGCAGC AGATCACGGC CGCCGCCGCC CGGCCGATCC GGCGCTTCCG CTGCGCAGCC
TGCGGCTTCG AGGCGCAGCA CTACTTCTGG CAATGCCCCG GGTGCCACAG CTGGGACAGC
TATCCGCCCA CGCGGTTGGA GGACCAGTGA
 
Protein sequence
MDFDPLTTLP LALLGLMVAF ALGWLASRFD VRQWKREQQE SPKAYYKGLN LLLNEQQDKA 
IDAFIEAVQH DPGTSDLHFA LGNLFRRRGE YERAVRVHQH LLGRGDLPAT ERERAQHALA
QDYVKAGLFD RAEAAFRALE GTAFATDARL DLLTLHERSR DWHAAIETAR KLEAAGAGSF
ANRMAHYGCE IALEADARRR PDEAEEALRK AREAAPQAPR PRVIAGQRLA RAGQHREALS
AWDELLATQP SAFALIASDY ANSALACGDA ADARGRLEAV YDRVPSLDIV TALQQLEPDP
AARHERLRRH LQAHPTLSAA SALLKEQQAQ GLAPTSTDAE QLQQITAAAA RPIRRFRCAA
CGFEAQHYFW QCPGCHSWDS YPPTRLEDQ