Gene Mext_0485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0485 
Symbol 
ID5831294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp529146 
End bp530366 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content68% 
IMG OID641366264 
Producthypothetical protein 
Protein accessionYP_001637973 
Protein GI163849930 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.626037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGA CCCACTCCTC TCCCGCCCAG CCCAGTCGCG CGACCCGGCC CGTGGTCGCC 
AGCGGCCTCG TCGCACTTGC AATGGCGATG GGGGTCGGCC GTTTCGCCTT TACGCCGCTG
ATGCCGCTGA TGATCCGTGA CGGTACGTTG GACGCTACCA CCGGTACGGA ATGGGCGGCA
GTCAACTATG TCGGGTATTT CGTGGGTGCC CTGACCGCCT CGTGGTTCAG CGGCAACCCG
CGTCGCGGTC TGCTGCTGAG CCTGATCGGT GTCGCTCTCA CGACACTGGC GATGGTGGCA
GTTGATGCCG TTCCCACCAC CCTGCTCGGG GTCATGCTGC GCGGGGCAGC TGGCGTGTTC
AGCGCCTGGG CGTTGGTGTG CACGAGCAGC TGGTGCCTGG CCGAACTTGC CCGGCGTCGG
GCCGGGCAAC TGGGCGCGTG GATCTACACG GGTGTCGGTC TCGGCATCGC GTTAGCCGGT
GTGCTGGCTT GGCTTGGCGG ACGCCAGCGG GCGGACTGGC TCTGGCTTGA ACTAGGGCTC
ATCGCCAGTG CCGGGGCGGT GCTCGTTTGG ACGCAGTCAC GGGGGCAAAG CACGATCCCG
GCCGAGATCG AAGAACGCGA GGCTACAGCA ATCGCTCCGA CGCGAGGAAG CGGGCAGTTG
GCTCTGGTGC TCTGCTACGG AATCTTCGGG TTCGGCTACA TCGTGCCGGC CACGTTCCTG
CCGGCCATGG CGCGCGAGCT AGCTCCCGAT CCCCTGGTGT TCGGGTTGAC TTGGCCCTTG
TTCGGCCTCG CCGCCGCTCT GTCGGTCGCG GCCGTGGCCC ACTGGCTGCC AAGCACATCG
CGTCAACGAC TGTGGGCTCT GTCACAGGGC GTCATGGCGC TCGGCACCGC CCTGCCGCTG
TTCGTCCAAG CTCTCTGGGC TGTCGCGGCC TCAGCGGTCT TGGTCGGCGG CACGTTCATG
GTAGCGACCA TGGCCGGCTT GCAGCTCGCC CGCGAGGCGC AGCCGGACAA TCCGACCCCG
CTCCTTGCGC GAATGACCGC TGCCTTCGCC GCCGGACAGA TCGCTGGCCC ATTGCTGGTT
CGCGCGCTTG GTTCCGGCCG CTGGGCCGGC TGGGATGCGC TGGGGTGGAC GGGCGCTCTC
GCTACGCTGC TGCTAGTGCT GACGGCAATA TGGCTCTGGC GCAGCACCAA ACCTTCCCTC
GAAAGCCTGA GGCCCGTCTG A
 
Protein sequence
MSTTHSSPAQ PSRATRPVVA SGLVALAMAM GVGRFAFTPL MPLMIRDGTL DATTGTEWAA 
VNYVGYFVGA LTASWFSGNP RRGLLLSLIG VALTTLAMVA VDAVPTTLLG VMLRGAAGVF
SAWALVCTSS WCLAELARRR AGQLGAWIYT GVGLGIALAG VLAWLGGRQR ADWLWLELGL
IASAGAVLVW TQSRGQSTIP AEIEEREATA IAPTRGSGQL ALVLCYGIFG FGYIVPATFL
PAMARELAPD PLVFGLTWPL FGLAAALSVA AVAHWLPSTS RQRLWALSQG VMALGTALPL
FVQALWAVAA SAVLVGGTFM VATMAGLQLA REAQPDNPTP LLARMTAAFA AGQIAGPLLV
RALGSGRWAG WDALGWTGAL ATLLLVLTAI WLWRSTKPSL ESLRPV