Gene Phep_3480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3480 
Symbol 
ID8254600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4142180 
End bp4143385 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content41% 
IMG OID644937131 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003093734 
Protein GI255533362 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTGC CCAAACAAAT TGCTGGCCAT TATAAAGAAT CATTTTCAGG TTTAAGCAGG 
GAAACCTGGA TCCTAAGCAT AGTGATGCTT ATTAACCGTA GCGGTTATAT GGCCGTTCCA
TTTATGGGTC TGTATGTGAC GCAGTCGCTG CACCGCCTGC CTTCAGATGC AGGATTGATC
ATTACGCTTT TTGGTATTGG CTCTATATTG GGCTCGGCCG TTGGCGGCAA GCTTACAGAC
GTCATTGGGT TCAGACCTGT ACAAATTATT GCGGCCATAG TAAGTGGTAT TTTCTTTTTA
TTTTTTGCCA GTGTTACCCA TTTTCAAACA CTATGTGTAC TGGCTTTGGT CATCAGCTTT
TTTTCAGAGG CCTTTCGGCC TGCTAATTTT GCTGCTATCG CAGCTTATGC AAAAAAAGGG
CTCGAAACTC GTTCCTATTC CTTAAACCGT CTGGCAACCA ATATAGGCTG GGCTTTTGGG
GTTAGTATGG GTGGTATGAT TGCTTCTTAT AACTACAGAC TCCTATTTTA TATAGATGGG
GCAGTTAGTA TTTTTGCCGG CCTGTGTATC CTCTTTTTCT TGCCCAGAAT CCGAAACTAC
AGCAAAACCA TAAAAGAAAA AGTAAAAGGT GTTGTGGTCA GAAAACCATG GCAGGATACT
GTTTTCGTTA AATTCATTCT TCTTACCACC GTTTTTATTT TAGGTTTCTT CCTGGTTTTC
CGTGTTGTTC CTGTATTTTT TAAAGAAATC TGGAAAATCG ATGAATTTAT GATCGGATTA
ATCCTTGGTC TTAATGGTGT AATCATTGCG CTATTTGAAA TGGTAATGAT CCATAAAATT
GAGCATAAAA AATCGCCCAT GTTCTTTATC GTTATTGGTG TTTTACTTAT CGCTGCTTCG
TTCCTGCTGC TGATGCTGCC TTTTGGCAAT CCGGTATTTC TGGGTGCATT ATGCCTTATT
TTATTTACAC TGGGCGAAAT GTTTACACTC CCATTTGTAA ATACATTTGT AATGAGCAGA
GCAAATGAGT TTAACAGGGG GTTATATGCT GCAGGCTACA TGTTAAGCTG GTCTGTAGCC
CAGGTAGTTG GTCCTACCGC AGGTTTTTAC ATTGCAGAGC AGTATGGTTA CAACACTTTG
TGGATTGGAT TGTCTACCTT GATGTTGCTG ACTGCTTATT TTTATAAACG CCTTAAAACA
GTCTAA
 
Protein sequence
MSLPKQIAGH YKESFSGLSR ETWILSIVML INRSGYMAVP FMGLYVTQSL HRLPSDAGLI 
ITLFGIGSIL GSAVGGKLTD VIGFRPVQII AAIVSGIFFL FFASVTHFQT LCVLALVISF
FSEAFRPANF AAIAAYAKKG LETRSYSLNR LATNIGWAFG VSMGGMIASY NYRLLFYIDG
AVSIFAGLCI LFFLPRIRNY SKTIKEKVKG VVVRKPWQDT VFVKFILLTT VFILGFFLVF
RVVPVFFKEI WKIDEFMIGL ILGLNGVIIA LFEMVMIHKI EHKKSPMFFI VIGVLLIAAS
FLLLMLPFGN PVFLGALCLI LFTLGEMFTL PFVNTFVMSR ANEFNRGLYA AGYMLSWSVA
QVVGPTAGFY IAEQYGYNTL WIGLSTLMLL TAYFYKRLKT V