Gene Mpe_A0232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0232 
Symbol 
ID4783940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp247783 
End bp249090 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content74% 
IMG OID640088783 
Productsodium:galactoside symporter family protein 
Protein accessionYP_001019429 
Protein GI124265425 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.626751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGC GGGCCATGGC AGCGCTCGGC CTGCCGCCCG CCACAGCCGG GCTGGCTTCG 
TCCGGCGACG GCCTGCGCTA CGGCGCCCTC GGCCTGCCGC TGGCCTTCGT CGCACTGCCG
CTGTACGTGC TGCTGCCGAA CCACTACGCG GCGCAGTTCG GCGTGCCGCT GGCCGCGCTC
GGCGCCGTGC TGCTCGCCGC GCGCCTGCTG GACGCGCTGG CCGACCCCCT GATCGGCCGC
TGGGTGGACC GGCTGTTCGC GCGGAAGGTC CTGTCAGCGT GGTGGGCCGC CACGATCGCG
GCGCTGGTGC TCGCGACCGG CTTTCGCGCG CTGTTCTTTC CCGCCGTCGA AGGCACGGCG
GCCCTGCTCG CGTGGTGCGC GATCGGTCTG GTCTTCACCT ACCTGGGCTA CAGCGTGGTC
TCGGTGGTCC ACCAGGCCTG GGGGGCACGG CTCGGCGGCG ACGAGGCGGG CCGCGCCCGC
GTGGTCGCGT GGCGCGAAGG GGCCGCGGTG GTGGGCGTGC TGATCGCCAG CGTGCTGCCG
TCGGCGTCCG GCCTGCAGGC CACCACGCTG GTGTTCGCGG TGCTGCTGCT GGCCGGGCTG
GCGCTGCTGC GGCAGGCACC AAGGGCGGTG CTTCGCCCAC CTCTGGACGC CGGCGGCGCC
GCATCGGTGC AGCCGTTTCG CGTGACGGCG TTTCGACGCC TGCTGGCGAT CTTCATCGTC
AACGGCATCG CCAGCGCAGT ACCGGCCACG CTGGTGCTGT TCTTCATCCG CGACCGGTTG
CAGGCACCGG CCTGGGAGCC TGCTTTCCTG GCAGCGTACT TCGCGGCCGG CGCGCTATCG
ATCCCGCTGT GGCTGCGCAG CGTCGCCCGC TTCGGCCTGG CGCGCAGTTG GCTCGCCGGC
ATGGGGCTGG CGATTGCCAC CTTCGGCTGG GCCGCGACGC TGGGCGCCGG CGACACGCTC
GGCTTCCTCG CGGTGTGCAT CGCCAGCGGC GCGGCGCTCG GCGCCGACCT CACGCTACCC
GGCGCGCTGC TGACCGGCGT GATCCAGCGT GCTGGCCACG CCGGCCACGG CGAGGGCGCC
TACCTCGGCT GGTGGAACTT CGCGACCAAG CTCAACCTCG CGCTCGCCGC GGGCGTGGCC
TTGCCGTTGC TGCAGGCCAC GGGCTACGAG ACCGGTGCGC GAGACCCCCA GGCGCTCGCC
GCGCTGAGCT TCGCCTACTG CCTGCTGCCG TGCGCGCTGA AGCTCGGCGC CGCGCTGCTG
CTGTGGGCGC TGTGGCTGCG CCACCCCGAC GCTGGAGATT TCGCATGA
 
Protein sequence
MSERAMAALG LPPATAGLAS SGDGLRYGAL GLPLAFVALP LYVLLPNHYA AQFGVPLAAL 
GAVLLAARLL DALADPLIGR WVDRLFARKV LSAWWAATIA ALVLATGFRA LFFPAVEGTA
ALLAWCAIGL VFTYLGYSVV SVVHQAWGAR LGGDEAGRAR VVAWREGAAV VGVLIASVLP
SASGLQATTL VFAVLLLAGL ALLRQAPRAV LRPPLDAGGA ASVQPFRVTA FRRLLAIFIV
NGIASAVPAT LVLFFIRDRL QAPAWEPAFL AAYFAAGALS IPLWLRSVAR FGLARSWLAG
MGLAIATFGW AATLGAGDTL GFLAVCIASG AALGADLTLP GALLTGVIQR AGHAGHGEGA
YLGWWNFATK LNLALAAGVA LPLLQATGYE TGARDPQALA ALSFAYCLLP CALKLGAALL
LWALWLRHPD AGDFA