Gene Phep_1458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1458 
Symbol 
ID8252559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1732490 
End bp1733707 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content44% 
IMG OID644935112 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003091734 
Protein GI255531362 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.288009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.592381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAGT TTTTTAGATT GTACTTAGAG GCCTACAGAG GGCTCTCTAC CCCGGCATGG 
ATGCTGGCAT TGGTGATGCT GATCAACAGG AGCGGAGCGA TGGTCATTCC CTTCTTAGGG
GTTTATATGG TCAACCACTT AAATTTTAGT ATAGAAGATA CGGGCACTGT ACTAAGCTGT
TTTGGTATTG GCGCTGTATC AGGCTCTTTT TTAGGTGGCT GGTTAACAGA TAAAGTGGGT
CATTTTAAAG TCCAGCTGTT TAGCCTGATC CTGACTGTGC CCATGTTTTT TCTGCTGCCG
GAACTGAATA CTGTTTTAAA GCTGGCCATT GGTGTGTTTA TACTCAGCAT TATTTCAGAG
ACCTTCAGGC CTGCAAACTC GGTTTCTATT GCTTATTATT CGAGGCCGGA TAACATTATC
CGTTCTTTTT CTTTAAACCG GATGGCGGTA AACCTTGGTT TTTCTATAGG TCCCGCCCTT
GGGGGCTTTC TGGCTGCAGT ATCGTATACC TTTTTATTTT ACGGAAATGC TGTTGCGGCG
TTTTTATCGG CTTTATTGTT CTTTATTTAC TTCCGCAACC GTAAGGGAAA TGAAAAGAAA
GCGGTTGTCC AGGAGAGTTT TACTGTTGAT CCTGGCACAA GCCGTTCTCC GTATAACGAC
GGGCTTTTTA TCGCTTTCAG TATGCTGAGC TGTATATATG CAATTTGTTT CTTTCAGCTG
CTGAGCACCC TGCCTTTGTA TTACCGCACA ATTTATAAAC TTACTGAAGC CGACATTGGG
ATTATTCTGG CTTTTAGTGG CATGGTGGTG TTTTTGTTTG AAATGCTCCT GGTACACATT
GCCGAGAAAA GAATGACCGC CAGGGCAGTT ATTGTATCGG GTGTATTGCT TTGCAGCCTG
TCGTTTTTTA TCCTCAATTT AACAAATGGC ATCTGGGTAC TGTACTTAGC TATGTTTGTG
CTTTGTATTT CCGAAATTCT GGCCATGCCC TTTATGTCTA CCATAACCCT GCAGCGTTCC
TCGTTAAAAA CCAGGGGCGC CTATATGGGC ATTAATGCTT TGTCTTTTTC TGCTGCACAT
GTGTTCTCGC CATTTGTGGG CACCAGGATA GCTGCTGCTT ATGGATTTGA AACCCTGTGG
TACGGTACTA CGTTGGTACT GTTGCTTACA GCTGCAGGGT TTTTGCTGGT CATGAAAAAA
ATGAAGTTAT CGGCATAA
 
Protein sequence
MKEFFRLYLE AYRGLSTPAW MLALVMLINR SGAMVIPFLG VYMVNHLNFS IEDTGTVLSC 
FGIGAVSGSF LGGWLTDKVG HFKVQLFSLI LTVPMFFLLP ELNTVLKLAI GVFILSIISE
TFRPANSVSI AYYSRPDNII RSFSLNRMAV NLGFSIGPAL GGFLAAVSYT FLFYGNAVAA
FLSALLFFIY FRNRKGNEKK AVVQESFTVD PGTSRSPYND GLFIAFSMLS CIYAICFFQL
LSTLPLYYRT IYKLTEADIG IILAFSGMVV FLFEMLLVHI AEKRMTARAV IVSGVLLCSL
SFFILNLTNG IWVLYLAMFV LCISEILAMP FMSTITLQRS SLKTRGAYMG INALSFSAAH
VFSPFVGTRI AAAYGFETLW YGTTLVLLLT AAGFLLVMKK MKLSA