Gene Phep_3851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3851 
Symbol 
ID8254985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4620453 
End bp4621970 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content47% 
IMG OID644937515 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003094104 
Protein GI255533732 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAAG ACAAAAAGAA TACACTTACC CTGGCCGCCG TTTGCCTTTC CTCATTAATG 
TTTGGATTGG AAATATCTAG TGTGCCGGTA TTACTGCCCA CACTCGAAAA ATTATTAAAA
GGCAATTTCA GTGATGTACA ATGGATCATG AATGCCTATA CCATTGCTTG CACTACAGTA
TTGATGGCTT CCGGTACGCT TGCAGACCGC TATGGCCGCA AACTTGTTTT TACAGCCGGG
CTTTTGCTTT TTGGTATAAC CTCTCTGCTT TGTGGGCTGG CCACGGCGAT GCCATTACTC
ATCGCAGGAC GTTTTTTACA AGGACTTGGC GGAGGCATTA TGCTGATCTG CCAGGTTGCG
ATTCTTTCTC ACCAGTTCCA GGACGGACGC GAACGCAGCC TGGCCTTTGC CGCCTGGGGG
ATTGTTTTCG GAGTAGGCTT GGGGTTCGGG CCTCTAATTG GCGGTATGAT CCTCACATTT
TTAAGCTGGA AATGGGTGTT TCTGGTACAT GTGCCACTCA GCTTGCTTAC ACTGGGACTT
GCATGGGCCG GGGTGCGGGA ATCGCGGTCG GCTCAAATAC AGCGATTGGA TACCTTAGGG
ATGGTCACTC TTTCCTTAGC TGTTTTTGGA CTGATTTATT ACATTACCCA GGGACCCGCA
ATGGGATTTT CAAGCCTTTA CGCGCTATTG ATTCTGGGGG CATCTGCAGC TTGTTTTCTG
CTCTTTTTAT ATGCAGAAAA GACAAGCACC CAGCCTATGT TTGATTTTAC CGTATTCAGG
ATACGGGACT TTTCCGGGGC CATCATCGGC TCTATTGGCA TGAACTTTTG CTTTTGGCCC
TTTATAATCT ACCTTCCTAT CTATTTTCAG GGTGTGTTGG GGTACAGTAG CCTTATGGCC
GGTACGATGC TGCTGGCCTA TACTTTGCCT ACATTGGTAG TTCCACCACT GGCCGAGCGG
CTTTTGATGA AATACCGGCC CGGTATCGTT ATCCCATTCG GGCTTTTTAT CATCGGACTG
GGATTTATAT TGATGTGGTT TGGCATCCAT GTTGCACACA TGGGGTCGTG GACCATGATA
CCAGGTATGT TGTTGTCCGG CATCGGACTA GGGTTCTCTA ATACTACGGT TACCAACACT
ACCACAGGTG CTGTGAGTTC AGACCGTGCA GGTATGGCTT CAGGGATAGA CATGAGTGCA
AGACTGATCA CCCTGGCAAT CAATATTGCT TTAATGGGAT TCTTGTTAAG TAAAGGTGTG
CTCATTCATT TAAAGATTGC ATTCGCAGGA GTTTTTGATA GTCCTCAGCT CCATTCGGTG
GCTGAAAAGA TCGCAGCAGG AAATTTTGCT GGCCTTCTGG AAAAATATCC CAGGATTGCT
ACTTTAGACC CGGCTGGCGA TATTACACAT CAGGCATTAG CTGGCGGTTT TCAACTGCTG
ACTTTGTTCG GCGGAATTGG GGTAGTTTTC CTTGCATTGG TCAGTTTCTT TATATTTAAG
CCACGATCAA TTCGATAA
 
Protein sequence
MFKDKKNTLT LAAVCLSSLM FGLEISSVPV LLPTLEKLLK GNFSDVQWIM NAYTIACTTV 
LMASGTLADR YGRKLVFTAG LLLFGITSLL CGLATAMPLL IAGRFLQGLG GGIMLICQVA
ILSHQFQDGR ERSLAFAAWG IVFGVGLGFG PLIGGMILTF LSWKWVFLVH VPLSLLTLGL
AWAGVRESRS AQIQRLDTLG MVTLSLAVFG LIYYITQGPA MGFSSLYALL ILGASAACFL
LFLYAEKTST QPMFDFTVFR IRDFSGAIIG SIGMNFCFWP FIIYLPIYFQ GVLGYSSLMA
GTMLLAYTLP TLVVPPLAER LLMKYRPGIV IPFGLFIIGL GFILMWFGIH VAHMGSWTMI
PGMLLSGIGL GFSNTTVTNT TTGAVSSDRA GMASGIDMSA RLITLAINIA LMGFLLSKGV
LIHLKIAFAG VFDSPQLHSV AEKIAAGNFA GLLEKYPRIA TLDPAGDITH QALAGGFQLL
TLFGGIGVVF LALVSFFIFK PRSIR