Gene Ent638_3880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3880 
Symbol 
ID5110607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4189885 
End bp4191114 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content61% 
IMG OID640494089 
Productmajor facilitator superfamily transporter 
Protein accessionYP_001178586 
Protein GI146313512 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.201886 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTAC CTGCCGCCGA ACCGGCACTT AGCGGATTGC GCCTCAATCT GCGCATCGTT 
TCTGTTGTCA TGTTTAATTT CGCCAGCTAT CTCACCATCG GTCTGCCGCT GGCGGTTTTG
CCCGGTTACG TCCACGACGT GATGGGATAC AGCGCCTTCT GGGCGGGGCT GGTGATCAGC
CTGCAATATT TCGCCACGCT GTTGAGCCGT CCACAGTCGG GGCGATATGC CGACCTGTTT
GGGCCAAAAA GTATTGTCGT CGTCGGGTTA TGCGGCTGTT TCATGAGTGG CCTGAGCTAT
TTGCTGGCGG CCACCACCAG CGGCTGGCCG ATAATGAGTC TGGCGCTGCT GTGCCTCGGG
CGCGTGATTC TGGGGATAGG GCAAAGCCTG GCGGGAACCG GCTCAACGCT GTGGGGCGTG
GGCGTCGTCG GTAACGCGCA TATTGGCCGC GTGATTTCGT GGAACGGGAT TGTCACCTAC
GGCGCGATGG CGCTCGGCGC ACCGCTTGGC GTGCTGTGCT ATGCGTGGGG CGGGCTTCAC
GGGCTGGCAT TAACCATTAT GACCGTCGCG TTTATTGCTA TTTTGCTGGC GATTCCACGA
CCGAAAGTCA AAGCCAGCAA AGGCAAGCCG CTGCCGTTTC GCGCCGTACT CGGGCGCGTC
TGGCCGTATG GCATGGCGCT GGCGCTGGCC TCCGCCGGTT TTGGCGTGAT TGCCACCTTT
ATCACGCTGT TCTACGACGC CAAAGGCTGG GACGGTGCGG CGTTTGCGCT GACGCTGTTT
AGCTGCGCGT TTGTCGGAGC GCGTCTGCTG TTCCCGAACG GCATCAACCG TCTGGGCGGC
CTGAACGTGG CGATGATTTG CTTCGCCGTT GAGATTATCG GCCTGCTACT GACCGGGATC
GCCGACGAGC CGTGGGTAGC CAAAATCGGT GTGTTCCTCG CGGGGGCAGG GTTTTCACTG
GTCTTTCCGG CGCTGGGCGT GGTGGCGGTT AAAGCGGTGC CGCAGCAAAA TCAGGGATCG
GCGCTGGCGA CCTATACGGT GTTTATGGAT ATGTCGCTCG GCGTAACCGG GCCGCTGGCC
GGGCTGTTGA TGGCGTGGGC GGGTGTACCG GTGATTTATC TGGCGGCGGC GGGGCTGGTG
GGTGCGGCAT TACTGCTGAC GTGGCGCTTA AAAAAACGGC CCCCGACGCA AACGCCGGAG
GCCGTTAAAT CCTCATCAAA CGTTACTTGA
 
Protein sequence
MPVPAAEPAL SGLRLNLRIV SVVMFNFASY LTIGLPLAVL PGYVHDVMGY SAFWAGLVIS 
LQYFATLLSR PQSGRYADLF GPKSIVVVGL CGCFMSGLSY LLAATTSGWP IMSLALLCLG
RVILGIGQSL AGTGSTLWGV GVVGNAHIGR VISWNGIVTY GAMALGAPLG VLCYAWGGLH
GLALTIMTVA FIAILLAIPR PKVKASKGKP LPFRAVLGRV WPYGMALALA SAGFGVIATF
ITLFYDAKGW DGAAFALTLF SCAFVGARLL FPNGINRLGG LNVAMICFAV EIIGLLLTGI
ADEPWVAKIG VFLAGAGFSL VFPALGVVAV KAVPQQNQGS ALATYTVFMD MSLGVTGPLA
GLLMAWAGVP VIYLAAAGLV GAALLLTWRL KKRPPTQTPE AVKSSSNVT