Gene Mext_4403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4403 
Symbol 
ID5830937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4898413 
End bp4900392 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content66% 
IMG OID641370196 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_001641842 
Protein GI163853799 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATC TCAACGACCG GCGCCCTCTC GGTCGACGCG GCCCGAGGAT GCAGGTTAAT 
CCGCCGGTCT TCTTCACCTC GGCCGGGCTG ACGCTCGCCT TCGCCGGTCT CAGCGCGCTG
TTTCCCGCAC AGGCCAAGTC GATCTTCGAC AGCCTACAAG CCGCGATCGT GCACGAATTC
GGCTGGTTCT ACATCGCCGT CGTCGCCGGC TTTCTCGGCT TCGCGATCTT CCTGATGCTG
AGCCGCTACG GCGACGTGAA GCTGGGGCCG GACGACAGCG AGCCCGACTA CAGCTACCTG
TCGTGGTTCG CGATGCTGTT CAGCGCCGGC ATGGGCATCG GCCTGATCTT CTTCGGTGTG
GCCGAGCCGC TTCAGCACTA TGCCACGCCC CCCGTCGGCG AGGGCAAGAC CATCGAGGCT
GCGCAGCGGG CGATGGTGCT GACCTTCTTC CACTGGGGGG TGCATGCCTG GGCGATTTAC
ATCGTCGTGG GGCTGGCGCT GGCCTATTTC GCATTCCGCC GCGGGCTGCC GCTGACGGTG
CGCTCGGCCC TCCACCCTCT CATCGGCGAC CGGATCAACG GTCCGATCGG GCACGCGATC
GACATCTTCG CGGTGCTCGG CACGATCTTT GGCGTCGCGA CCTCGCTCGG GCTCGGCGTG
CTTCAGGTCA ATGCCGGTTT CACCCACCTG TTCGGCGTAC CGAACAACAC CTTCGTACAG
ATCATCCTGA TCGCCGCCAT TACCGGCTGC GCCACGCTCT CGGTCGCCTC GGGGCTCGAC
AAGGGCGTGA AGGTCCTGTC CGAACTGAAC ATCATCCTGG CCGTCGTGCT GCTCGCCTTC
GTGCTGATCA CCGGCTCGAC GGTGTTCCTG CTCCAGGCCT TCGTTCAGAA TCTCGGCGCT
TATCTCGGGG CTGTGGTCGA GCGCACGCTC CAGACCTACG CCTACAAGCC CAATGAATGG
CTCGGCAGTT GGACGCTGTT CTACTGGGGC TGGTGGATCG CGTGGTCGCC CTTCGTCGGC
ATGTTCATCG CCCGGATCTC GCGCGGGCGC ACCATCCGCG AATTCGTCAC CGGCGTGCTG
CTGGTGCCGG TGCTGTTCAC GTTCTTCTGG ATGACCGTGT TCGGTAACAC GGCCATCGAG
ATGGATCGCG CCGGCGCAGT GCCGCTCGCG CAGATCGTCA AGGACAACAT GCCGGTCGCC
CTGTTCGAGA TGCTCGGGCA CCTGCCGTTC GGCATGATCG CCTCGGGACT GGCGACGCTG
CTCGTGATCT TCTTCTTCGT CACCTCGGCG GATTCCGGAG CGCTGGTGAT CGACATGATC
ACCTCGGGCG CGGCCGACAA CCCGCCGCTC TGGCAGCGGG TGTTCTGGGC GGTCAGCAGC
GGCGCCATCG CCGCCGTGCT GCTGGTGGCG GGCGGGCTTG AGGCGCTCCA GACGGCGGCG
ATCGCCAGCG CCCTGCCCTT CTCCATCGTG ATGATCTTCA TCTGCTACGG CCTGCTCCGG
GCCCTCCAAC TGGAGGGGCG TGCCGGCGGC CTCGACCTGT CCTCGGCCGC CGCCGGGCCG
TCGGGCGGCC TGTCCTGGCA GGAGCGGCTG GCGGCGATCA CCCACTCCTA CCGGAAGGAA
GACCTCAAGA CCTTCCTCGA CGAGACGGTC GCGCCCGCCC TCGAAGCGGT GGCCGGGCAG
ATGCGCGAGA GCGGCTTGTC GCCCGAGGTC GTGCGGTCGG CCGAGCGCAT CGATCTGATC
GTTCCCTACG GGGACCGTGG CGCCTTCCGC TACGGCATCC GCGTGCGCGG ATTGCGGAAT
CCGAGCTTCG CCTGGGCCGA GAATCCGTCG AAGTCGTCGG ATGAGCGCCA TTATCGCGCC
CTCGTTCAGA CCTCCGAGGG CGACCGCCCG CACGACGTCA CCGGCTACGG CCGGGATCGA
GTCATCGACG ATCTCCTGAA GCGCTATGCA CAGTTCCGGG CGGTCCGGGG CGCTGCCTGA
 
Protein sequence
MNDLNDRRPL GRRGPRMQVN PPVFFTSAGL TLAFAGLSAL FPAQAKSIFD SLQAAIVHEF 
GWFYIAVVAG FLGFAIFLML SRYGDVKLGP DDSEPDYSYL SWFAMLFSAG MGIGLIFFGV
AEPLQHYATP PVGEGKTIEA AQRAMVLTFF HWGVHAWAIY IVVGLALAYF AFRRGLPLTV
RSALHPLIGD RINGPIGHAI DIFAVLGTIF GVATSLGLGV LQVNAGFTHL FGVPNNTFVQ
IILIAAITGC ATLSVASGLD KGVKVLSELN IILAVVLLAF VLITGSTVFL LQAFVQNLGA
YLGAVVERTL QTYAYKPNEW LGSWTLFYWG WWIAWSPFVG MFIARISRGR TIREFVTGVL
LVPVLFTFFW MTVFGNTAIE MDRAGAVPLA QIVKDNMPVA LFEMLGHLPF GMIASGLATL
LVIFFFVTSA DSGALVIDMI TSGAADNPPL WQRVFWAVSS GAIAAVLLVA GGLEALQTAA
IASALPFSIV MIFICYGLLR ALQLEGRAGG LDLSSAAAGP SGGLSWQERL AAITHSYRKE
DLKTFLDETV APALEAVAGQ MRESGLSPEV VRSAERIDLI VPYGDRGAFR YGIRVRGLRN
PSFAWAENPS KSSDERHYRA LVQTSEGDRP HDVTGYGRDR VIDDLLKRYA QFRAVRGAA