Gene Mext_4685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4685 
Symbol 
ID5834236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5237784 
End bp5239139 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content66% 
IMG OID641370480 
Productgeneral substrate transporter 
Protein accessionYP_001642124 
Protein GI163854081 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0475747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.909787 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGACA CCAGCGCCGC CGCGCCGCCG CTTGACGAGG CCGCCGACCG CCGCCGCCGC 
ATCATGGCCA TCGTCGGTTC CTCGTCGGGC AATCTCGTCG AGTGGTACGA CTTCTACTGC
TACGCCTTCT TCGCTCTCTA CTTCGCCCCC GTCTTCTTCC CGGAGGGTGA CGACACGGGC
CAGCTCCTGA AGTCGGCCGC GGTCTTCGCG GTGGGCTTCT TCATGCGGCC GATCGGCGGC
TGGCTGTTCG GTCGCATCGC CGACCGTCTC GGGCGCAAGA CCTCGCTGAT GATCTCGGTG
CTGATGATGT GCGGCGGCTC GCTCGCCATC GCCCTGCTGC CGACCTACGC GACCGTCGGC
CATCTCGCGC CGGTTCTCCT CGTGATCGCG CGCATGGTGC AGGGCCTCTC GGTCGGCGGC
GAGTACGGCA CCAGCGCGAC CTACATGAGC GAGGTCGCGA CCAAGGGGCA GCGCGGGTTC
TTCGCCTCGT TCCAGTACGT CACGCTGATC GGCGGCCAGT TGCTCGCTTC GCTGGTGCTC
GTCGTGCTCC AGAGCGTGCT CACAGCCGAG CAACTCACGG CTTGGGGCTG GCGCATCCCC
TTCGTGATCG GCGCGCTGGC CGCGGTGGTT GCCCTGTTCC TGCGCCGCTC GCTCTCCGAG
ACGATGAGCG CGGAGAACAA GGATTCGAAG GAGGCCGGAA CCCTCGCCGG CCTGCTCAAG
CACTGGCGGG CCTTCGCCGT GGTGCTGGCC TATACGGCCG GCGGGTCGCT GTCGTTCTAC
ACGCTCACGA CCTACATGCC GAAATACCTC TTCAACACCG CCCATATAGA CAAGGTGACT
GCGTCGCAGA TCACGACGGT GGCACTGTTC GCCTACATGG CGATCCAGCC CTTCTTCGGC
TGGCTCTCCG ACCGGATCGG CCGGAAGACG AACATGCTGC TGTTCAGCGG CCTCGGCATG
GTGATGATCG TGCCGTTGAT GACGGCCATC GGCACCACCA CCGACCCCGT CCTGTCCTTC
GGCCTGATCA TGACCGGGCT CGTCGTGATC AGCTTCTACA CCGGCATCAG CGGCATCGTG
AAAGCGGAGC TGTTCCCGAC TCAGGTGCGG GCGCTCGGCG TCGGCCTGTC CTACGCGGTG
GCCAACTCAC TGTTCGGCGG CACGGCGGAG GCGGTCGCGC TCTGGCTCAA GCATGTCGGC
GCGGAGACCA GCTTCTTCTG GTACGTCGCC GTGATGCTGG CGATCTCGTT CATCGCTTCA
CTGGTGATGC CGAACCCGAA GCGCCACGGC TATCTCGACG GGGACGGCAC CGTCGAGGAG
GCTTTGGGGC GCAAGACGAG CCCGGTTCTG GCCTGA
 
Protein sequence
MLDTSAAAPP LDEAADRRRR IMAIVGSSSG NLVEWYDFYC YAFFALYFAP VFFPEGDDTG 
QLLKSAAVFA VGFFMRPIGG WLFGRIADRL GRKTSLMISV LMMCGGSLAI ALLPTYATVG
HLAPVLLVIA RMVQGLSVGG EYGTSATYMS EVATKGQRGF FASFQYVTLI GGQLLASLVL
VVLQSVLTAE QLTAWGWRIP FVIGALAAVV ALFLRRSLSE TMSAENKDSK EAGTLAGLLK
HWRAFAVVLA YTAGGSLSFY TLTTYMPKYL FNTAHIDKVT ASQITTVALF AYMAIQPFFG
WLSDRIGRKT NMLLFSGLGM VMIVPLMTAI GTTTDPVLSF GLIMTGLVVI SFYTGISGIV
KAELFPTQVR ALGVGLSYAV ANSLFGGTAE AVALWLKHVG AETSFFWYVA VMLAISFIAS
LVMPNPKRHG YLDGDGTVEE ALGRKTSPVL A