Gene Ent638_2059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2059 
Symbol 
ID5113475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2233139 
End bp2234623 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content57% 
IMG OID640492247 
Productmajor facilitator transporter 
Protein accessionYP_001176786 
Protein GI146311712 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCGTC AGTGGTTAGC GTTAGTGATT ATCGTGCTGG TGTATATCCC GGTGGCGATT 
GACGCAACGG TGCTGCACGT CGCCGCGCCG ACATTGAGCA TGACGCTCGG AGCCAGCGGC
AACGAATTGC TGTGGATTAT TGATATTTAT TCGCTGGTGA TGGCCGGTAT GGTGTTGCCG
ATGGGCGCGC TGGGCGATCG CATCGGCTTT AAAAAACTGC TCATGCTCGG CAGTGTGTTA
TTTGGGTTGT CATCGCTGGC CGCCGCGTTT TCCCCAACGG CAGGCTGGCT TATCGCTGCG
CGTGCGTCGC TGGCGATTGG CGCCGCGATG ATTATTCCGG CAACGCTTGC GGGGATCCGC
ACGCTGTTTA CCGAACCTCG TCACCGTAAT ATTGCCCTCG GCGTGTGGGC AGCGGTGGGG
TCGGGCGGTG CGGCGTTTGG TCCGCTGGTT GGCGGCATTC TGCTGGAGCA TTTCTACTGG
GGTTCGGTCT TTTTGATCAA CGTCCCGATT GTGATTGTGG TTGTCGCCCT TGCGGCTCGC
CTCGTCCCGA AACAGCAGGG CCGTCCTGAG CAACCGCTGA ATATTAGCCA TGCGCTGATG
CTGATCGTCG CGATTTTGCT GCTGGTCTAC AGCGCCAAAA CCGCGTTGAA AGGGGCGCTT
TCGCCGTGGC TGGTTGCGGG GACATTACTG ACGGGCGCGG TGATGCTGTT TGTTTTTGTG
CGTATTCAGT TGCGCGCCAG CACGCCGATG ATCGACATGC GTCTGTTTTG CAATCGCATC
ATTTTAAGCG GCGTTGTGAT GGCGATGACG GCCATGATCG CGCTGGTGGG TTTTGAACTA
CTGATGGCGC AGGAGTTGCA GTTTGTCCAC GGTTTCACGC CGTTTGAAGC GGGCAAGTTT
ATGCTGCCGG TGATGGTCGC CAGCGGATTC AGTGGACCGA TTGCTGGCGT GCTGGTGGGA
CGACTGGGGT TGCGGATCGT GGCGGCGGGC GGCATGGGGT TGAGTGCGGT GAGCTTCATT
GGATTATCGA TGCTCGACTT CAGCACGCAG CAGTGGCAAG CGTGGAGCCT AATGGTGCTG
CTGGGCTTTA GCGCCGCCAG TGCGCTACTG GCATCGACTT CTGCCATCAT GGCTGCCGCG
CCTAAAGAGA AAGCGGCAGC AGCGGGTGCG ATTGAAACCA TGTCTTATGA GCTGGGCGCG
GGGTTAGGCA TCGCCATTTT TGGTCTGCTT TTAACCCGCA GCTTCTCGGC GTCGATTGTG
TTGCCGCAAG GCTTAAACAA CACGCTGGCG GAAAAAGCGT CGTCATCCAT TGGTGAAGCC
GTGAAAGTGG CGCAGGATTT GACGCCAACG CTGGCAGACT CGGTCATTGA ATCCGCAAAA
GCCGCGTTTA TCACCTCACA CAGCGTGGCA TTAGGAAGCG CAGGCGGGAT GTTGTTGATC
CTTGCGATCG GGATTTGGTT TAGCCTGGCG AAAGTGAAAA AGTAG
 
Protein sequence
MFRQWLALVI IVLVYIPVAI DATVLHVAAP TLSMTLGASG NELLWIIDIY SLVMAGMVLP 
MGALGDRIGF KKLLMLGSVL FGLSSLAAAF SPTAGWLIAA RASLAIGAAM IIPATLAGIR
TLFTEPRHRN IALGVWAAVG SGGAAFGPLV GGILLEHFYW GSVFLINVPI VIVVVALAAR
LVPKQQGRPE QPLNISHALM LIVAILLLVY SAKTALKGAL SPWLVAGTLL TGAVMLFVFV
RIQLRASTPM IDMRLFCNRI ILSGVVMAMT AMIALVGFEL LMAQELQFVH GFTPFEAGKF
MLPVMVASGF SGPIAGVLVG RLGLRIVAAG GMGLSAVSFI GLSMLDFSTQ QWQAWSLMVL
LGFSAASALL ASTSAIMAAA PKEKAAAAGA IETMSYELGA GLGIAIFGLL LTRSFSASIV
LPQGLNNTLA EKASSSIGEA VKVAQDLTPT LADSVIESAK AAFITSHSVA LGSAGGMLLI
LAIGIWFSLA KVKK