Gene Phep_2461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2461 
Symbol 
ID8253568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2856090 
End bp2857250 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content48% 
IMG OID644936111 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003092727 
Protein GI255532355 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA GCTTACTCAC TTTAACCCTT GGCGGACTGG GGATTGGCAT CACAGAGTTT 
GTGATGATGG GCCTGTTGCC CGATATTGCA AAAGACCTCT CCATTACCAT CCCTCAGGCG
GGGCACCTCA TTTCTGCTTA TGCCCTGGGT GTGGTTATCG GTGCACCTTT ACTGGTGGCC
ATTGCAGGAA GCTATCCCCC AAAAAAGATA TTGATGGCCC TGATGATGAT GTTTGTGGCC
TTTAACGCTT TGTCGGCTTT TTCTCCTGAT TATACCACCA TGTTTATTGC CCGCTTACTT
GCGGGTTTGC CGCATGGTGC CTTTTTTGGA GTAGGGTCGG TAGTGGCCAG CCGCATTGCA
GATAAAGGAA AAGAAGCTTC GGCCGTATCG CTGATGTTTG CCGGTTTAAC CATTGCCAAT
GTGATCGGTG TGCCGCTGGG TACTTTTATT GGTCACAATT ATTCCTGGCG CTATACTTTC
GTGATCATTG TTGTTGTAGG CTTAATTACC CTTTTGAGTC TGAAATTATG GATGCCGGCA
CTGCCTGCTA CCAAGGACAG GGATCTGAAA AAAGAACTGG GTTTCTTTAA GCTGCCCGAA
GCATGGCTCA TTATCCTGAT GATTGCCATA GGTACAGGAG GACTGTTTTC CTGGTACAGC
TATATTGCCC CGCTTTTAAC CGATGTTTCG GGTTTTTCTG CCGATTCGAT TACTTACATC
CTGGTACTGG CCGGCCTGGG TATGCTGGTT GGTAACTTTA TAGGGGGCAA GCTGGCCGAC
AGGTTTTCGC CGGCAAAAGC TTCTGTTTCT TTGCTGATTG CCATGGCGGT TACCTTATTT
ATTGTACATT ATATTTCTAC CAATCAGGTA CTTTCGCTGG TCATGACCTT CATTACCGGT
GCGGTGGCTT TTGCCCTGGC GGCACCGATC CAGATGCTGA TGATCAAAAC GGCCAAAGGC
TCGGAGATGC TTGCTGCATC TGTTAGCCAG GCCAGTTTTA ATATAGGCAA CGCTTTGGGT
GCCTTTTTAG GTGGTTTGCC CCTGGCAGCT GGCTATGATT ATACTTCTCC GGTATGGGTG
GGTACCTTAA TGGCCTTAAC CGGGGCCGTA TTTGCCTGGA TGCTGATTGC CCGAAATAAA
AGAATGGCTT TGGCCGTTTA A
 
Protein sequence
MKKSLLTLTL GGLGIGITEF VMMGLLPDIA KDLSITIPQA GHLISAYALG VVIGAPLLVA 
IAGSYPPKKI LMALMMMFVA FNALSAFSPD YTTMFIARLL AGLPHGAFFG VGSVVASRIA
DKGKEASAVS LMFAGLTIAN VIGVPLGTFI GHNYSWRYTF VIIVVVGLIT LLSLKLWMPA
LPATKDRDLK KELGFFKLPE AWLIILMIAI GTGGLFSWYS YIAPLLTDVS GFSADSITYI
LVLAGLGMLV GNFIGGKLAD RFSPAKASVS LLIAMAVTLF IVHYISTNQV LSLVMTFITG
AVAFALAAPI QMLMIKTAKG SEMLAASVSQ ASFNIGNALG AFLGGLPLAA GYDYTSPVWV
GTLMALTGAV FAWMLIARNK RMALAV