Gene Phep_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1603 
Symbol 
ID8252705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1898457 
End bp1899683 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content45% 
IMG OID644935257 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003091878 
Protein GI255531506 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.501272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA CAAATCCGGC CTCTGCCATT GAGCCTAAAC AAATTGCCCA GCAAACCGTA 
TTTCCGATAC TTTTTGCCAT TAGTTTTTCC CATTTACTAA ATGATACGAT ACAGTCGCTC
ATACCTGCCA TTTATCCTAT TGTAAAGAAT ACCTATCAGC TGAGCTTTTC GCAGATTGGC
TTAATTACCC TGATGTTCCA AATGGCCGCT TCTTTATTCC AGCCATTTGT AGGCTTATAT
ACCGATAAAA AACCACAGCC TTATTCACTG GCTATAGGAA TGGGTTTTAC GCTGGTCGGC
CTGATCACTT TATCTTTGTC CAACGGATTT TACTTCATGC TGCTTTCTGT TGCACTTATT
GGTACGGGCT CTTCCATATT CCATCCGGAA GCATCTCGTA TGGCCCATGC TGCTTCAGGC
GGAAGGAGAG GCCTGGCCCA GTCCATCTTT CAGCTGGGTG GCAATGCCGG AAGTTCTATC
GGACCTTTGC TGGCAGCCTG GATCATTGTG CCCTACGGAC AGTTCAGTGT GATCTGGTTT
TCTATCATTG CTTTACTGGC CATTATGATT TTGAGCTGGG TAGGCAAATG GTATAAGGGC
TATATGGTCA ATTTAAAGGC CAGAATGGGG GCAAAAGTAA ATGTGGTAAC CAATAATTTC
TCCAGAAAAA GGGTGGTATT TGCCGTGATC ATTTTACTGG TCCTTATCTT TTCAAAATAC
TTTTACATGG CCAGTCTGAC CAGCTACTTT ACCTTCTATC TAATAGATAA GTTTCATGTA
CCGGTGCAAA CCTCGCAGCT TTACCTGTTT GTATTTTTAT TTTCCGTTGC GGCCGGTACA
CTGATCGGTG GTCCGGTGGG CGACAGGTTC GGCCGTAAAT ATGTGATCTG GTTTTCTATT
TTAGGTACAG CACCTTTTGC CTTGTTGCTG CCCCATGCCA ATTTATTCTG GACCGGGGTA
TTGATCGTAC CGATAGGTGT GATCCTGGCC TCAGCATTCT CTGCTATTCT GGTGTATGCG
CAGGAACTGA TACCGGGTAA GGTGGGACTG GTTGCGGGAT TGTTCTTTGG TTTTGCTTTT
GGTATGGGCG GTATAGGGTC TGCTTTACTG GGTAAGCTTG CCGATAGCAC CAGCATCAAT
TACGTATTTC ATATCTGTGC ATTCTTGCCC CTGATTGGTA TCATTACCGG GTTTTTGCCC
AATATTGAGG GCAGGAAAAA AGCCTGA
 
Protein sequence
MKTTNPASAI EPKQIAQQTV FPILFAISFS HLLNDTIQSL IPAIYPIVKN TYQLSFSQIG 
LITLMFQMAA SLFQPFVGLY TDKKPQPYSL AIGMGFTLVG LITLSLSNGF YFMLLSVALI
GTGSSIFHPE ASRMAHAASG GRRGLAQSIF QLGGNAGSSI GPLLAAWIIV PYGQFSVIWF
SIIALLAIMI LSWVGKWYKG YMVNLKARMG AKVNVVTNNF SRKRVVFAVI ILLVLIFSKY
FYMASLTSYF TFYLIDKFHV PVQTSQLYLF VFLFSVAAGT LIGGPVGDRF GRKYVIWFSI
LGTAPFALLL PHANLFWTGV LIVPIGVILA SAFSAILVYA QELIPGKVGL VAGLFFGFAF
GMGGIGSALL GKLADSTSIN YVFHICAFLP LIGIITGFLP NIEGRKKA