Gene M446_0871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_0871 
Symbol 
ID6129296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp978542 
End bp979792 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content65% 
IMG OID641641182 
Productmajor facilitator transporter 
Protein accessionYP_001767856 
Protein GI170739201 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACACCC ACGCCCGAAT GACCAGCGCG CCACCGCCCA TCAGCCGATC GAACACAATG 
TCCGCCGGAC TAACTGTCCT GTTCGCATGC GCCGTTGGCG TCGTCGTACT GAGCCTGTAT
GCGTCGCAGC CCCTGATCGG CTTGATCGGA TCGTCATTTT CGTTAAGCAC ATCCGAAGCC
AGTCTTGTAT CGACACTTAC TCTGCTCGGC TACGCGAGCG GCCTGTTCCT GCTCGTCCCA
CTGACGGATC TCGTCGAGAA CCGGACGGTG ATCGTTGTGA CTTTGCTGGT TGACGTTGCT
GCCCTGGCGG CGATCGCGCT TGCCCCGACG CCATTTCTCT TCCTACTGGC GTCGTACGTC
GCCGGTGTGA CCACCAGCGC GATTCAAATG CTCGTACCGG TGGCGGCTCA ACTCTCTCCT
GAAGCACATC GCGGTCGCGT CGTCGGCAAC GTGATGAGCG GCCTCATGCT CGGCATCCTT
CTTTCGCGGC CAGCCGCGAG TTGGGTTGCG GAGTTCGTCG GTTGGCGCTG GTTCTACGGT
GGGCTCTCCT TGATCATCGC AGCCCTGAGC ATGATCCTGG CCACCGTCTT ACCGGTCCGC
AAGCCCGCGA CCGGCACGAA TTATTCGGCG CTCATCGGAT CGATGCTGAC GATCCTGCGG
GAGGAGCCCG TCCTTCGGCG CCGGGCCAGT TACCAGGCCC TGTGCATGGG CGCATTCGGC
GTGTTCTGGA CGTCGGTCGC CTTGCGCCTT TCCGATCAGC CATTCTCCCT CGGTCAGACC
GGAATCGGCC TTTTCGCCCT GGCCGGAGCC GCCGGGGCGG TCGTCGCCCC GATCGCCGGG
CGAGCGGGCG ACCGGGGCTG GACGCGCAGC GCAACGCGAC TCGCCCACCT CGCGGTCATC
GCCGCGATGA TCCTCGCCGG CATCGGCGGC GACGTCCTGG TCGGCACGCC TTTCGCCCCG
AGCTGGGCTC CTCTCGCGAT CCTGGTCGCC AGCGCCGTCC TGCTCGATCT GGGGGTGATC
GGCGATCAGA CGCTGGGACG GCGCGCAATC AACCTGTTGC GCCCCGAAGC AAGAGGCCGC
GTGAACGGGC TGTTCACAGG CCTGTTCTTC CTCGGCGCCG CGGCCGGGTC GGCCTTATCC
GGACTGGCAT GGGTCAGCTT CGGCTGGTTG GGTGTCTGCT CAGTCGGACT CGCGTTCGGA
TGCGTCACCC TCGTGCTTTC ATCGTCTCAG CCGCAGGCAG CGGCGCGCTG A
 
Protein sequence
MHTHARMTSA PPPISRSNTM SAGLTVLFAC AVGVVVLSLY ASQPLIGLIG SSFSLSTSEA 
SLVSTLTLLG YASGLFLLVP LTDLVENRTV IVVTLLVDVA ALAAIALAPT PFLFLLASYV
AGVTTSAIQM LVPVAAQLSP EAHRGRVVGN VMSGLMLGIL LSRPAASWVA EFVGWRWFYG
GLSLIIAALS MILATVLPVR KPATGTNYSA LIGSMLTILR EEPVLRRRAS YQALCMGAFG
VFWTSVALRL SDQPFSLGQT GIGLFALAGA AGAVVAPIAG RAGDRGWTRS ATRLAHLAVI
AAMILAGIGG DVLVGTPFAP SWAPLAILVA SAVLLDLGVI GDQTLGRRAI NLLRPEARGR
VNGLFTGLFF LGAAAGSALS GLAWVSFGWL GVCSVGLAFG CVTLVLSSSQ PQAAAR