Gene Mext_2094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_2094 
Symbol 
ID5833184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp2337184 
End bp2339070 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content70% 
IMG OID641367892 
ProductTRAP C4-dicarboxylate transport system permease DctM subunit 
Protein accessionYP_001639561 
Protein GI163851518 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1593] TRAP-type C4-dicarboxylate transport system, large permease component
[COG3090] TRAP-type C4-dicarboxylate transport system, small permease component 
TIGRFAM ID[TIGR00786] TRAP transporter, DctM subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.386051 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0954493 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTCG ACGTGCAACT CGACGCGCCC GCGCCGGCCG CCCCCGTGCC CGCCGCCGGC 
CAAAGCTCGT ATCTCCAGAC CTTCGATCGC GTCTTCGGGC GGATGGTCGA GATCCCCGCG
GCGATCCTCG TCGTTCTGGA GATCGTCGTG CTCCTGAGCG GCGTCGTCAG CCGCTACGCC
CTGCACCGGC CTCTGGTCTG GTCCGACGAA CTCGCCTCGA TCCTGTTCCT GTGGCTCGCC
ATGCTGGGGA CGGTCGTCGC CTTCCGGCGC GGCGAACACA TGCGCATGAC CGCCTTCGTC
GGCATGGCCA CGCCGCGTGC GCGGGCCTTC ATGGACACCC TGGCGATCGT GGCCGGCCTC
GTCTTCGTCG GGGTTCTGCT GGAGCCGGCC TTCGAGTTCG CGATCGAGGA ATCCTGGGTC
TACACGCCGG CCCTCGACCT CCCCAACACC TGGCGCGCCG CCGCCCTGCC GGTCGGGTTC
GCGCTGATGC TCGTCACCGC CGTGCTGCGC CTCGTGCAGG AGAGCGACTG GCGCCTCACG
GGCGCGGCTT TGGCTCTCGG GGCCGGCCTC GCCGCCGTGC TCGCCGGCCT GTCGCCGCTC
CTGAGCAGCA TCGGCAACCT CAACCTGCTG GTGTTCTTCC TGCTTGGTGT CGGCGCCCTG
GTGTTCTCCG GCGTGCCGAT CGCCTTCGCC TTCGGGCTCT CGACCTACGC CTACCTCATG
AGCACGACCT ACGCGCCCGC CATGGTCGTC GTCGGGCGCA TGGACGAGGG CATGAGCCAT
CTCATCCTGC TGGCGGTGCC GCTGTTCATC TTCCTCGGCC TCCTGATCGA GATGACCGGC
ATGGCCCGCG CCATGGTCGG TTTCCTGGCA AGCCTGTTGG GCCATGTGCG CGGCGGCCTG
CACTACGTGC TGGTGGGGGC GATGTACCTC GTCTCCGGCA TCTCCGGCTC GAAGGCCGCC
GACATGGCCG CCGTCGCCCC GGTGCTGTTT CCCGAGATGA AGGCGCGCGG CGCCAAGGAG
GGCGATCTCG TCGCGCTGCT CTCCGCGACC GGCGCCCAGA CCGAGACGAT CCCACCGTCG
ATCGTGCTCA TCACCATCGG CTCGGTGAGC GGCGTCTCGA TCACCGCCCT GTTCACCGGC
GGGCTGCTGC CGGCCGTCGT CCTCGGCGCG ACGCTGTGCG CGCTGGTCTG GTGGCGCTAC
CGCGGCGAGG ACATGAGCCA CGTCGTCCGC CCCGGTCGGA AGGCGATCCT GAAGGCGCTC
GTCGTGGCGG TGCCGGCGCT CGCGCTGCCC TTCGTCATCC GGGCCGCGGT GATCGAGGGT
GTGGCCACCG CCACCGAGGT CTCGACGATC GGCATCGTCT ACGCCGTGCT GGCGGGGCTC
CTCGTCTACC GGCAGTTCGA CTGGCGCCGC CTCATGCCCA TGCTGGTGAC GACGGCGACG
CTGTCGGGGG CGATCCTGCT GATCATCGGC GCGGCCACCG CCATGGCCTG GGCGCTGACG
CAGTCCGGCT TCTCGCAGGC GCTCGCCGTC GCGATGAAGG AACTGCCCGG CGGCGCGCTC
GGCTTCCTCG TGGTCTCCGC CCTCGCCTTC ATCCTGCTGG GTTCGGTGCT GGAGGGAATC
CCGGCGATCG TCCTGTTCGG GCCGCTGATG TTTCCCATCG CCCGGGCCGT GGGCGTGCAC
GAGGTGCACT ACGCCATGGT CGTGGTTCTC GCGATGGAAA TCGGCCTGTT CGCCCCGCCC
TTCGGCGTCG GCTACTACGC GGCCTGCGCC ATCAGCCGCA TCCATCCCGA TGCCGGCATG
CGGCCGATCA TCGGCTACGT CGCGGCGCTG CTGGTCGGGC TGATCGTGAT CATCCTGGTG
CCCTGGTTCT CGATCGGCTT CCTCTAA
 
Protein sequence
MQLDVQLDAP APAAPVPAAG QSSYLQTFDR VFGRMVEIPA AILVVLEIVV LLSGVVSRYA 
LHRPLVWSDE LASILFLWLA MLGTVVAFRR GEHMRMTAFV GMATPRARAF MDTLAIVAGL
VFVGVLLEPA FEFAIEESWV YTPALDLPNT WRAAALPVGF ALMLVTAVLR LVQESDWRLT
GAALALGAGL AAVLAGLSPL LSSIGNLNLL VFFLLGVGAL VFSGVPIAFA FGLSTYAYLM
STTYAPAMVV VGRMDEGMSH LILLAVPLFI FLGLLIEMTG MARAMVGFLA SLLGHVRGGL
HYVLVGAMYL VSGISGSKAA DMAAVAPVLF PEMKARGAKE GDLVALLSAT GAQTETIPPS
IVLITIGSVS GVSITALFTG GLLPAVVLGA TLCALVWWRY RGEDMSHVVR PGRKAILKAL
VVAVPALALP FVIRAAVIEG VATATEVSTI GIVYAVLAGL LVYRQFDWRR LMPMLVTTAT
LSGAILLIIG AATAMAWALT QSGFSQALAV AMKELPGGAL GFLVVSALAF ILLGSVLEGI
PAIVLFGPLM FPIARAVGVH EVHYAMVVVL AMEIGLFAPP FGVGYYAACA ISRIHPDAGM
RPIIGYVAAL LVGLIVIILV PWFSIGFL