Gene Phep_1696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1696 
Symbol 
ID8252798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2005463 
End bp2006797 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content41% 
IMG OID644935348 
Productglucose/galactose transporter 
Protein accessionYP_003091969 
Protein GI255531597 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID[TIGR01272] glucose/galactose transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0309117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.239808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAA CAAATGTAGC AGCTGTCGAC GTCAGTTCTC TGAGCAAGTG GGATACGGTT 
ATCTCCATTT TTATCATTGG TTTATTGTTT TTTATTTTTG GTTTTGTAAG CTGGGTAAAT
GCCATATTGA TTCCTTATTT CAAAATTGCC TGTGAACTCA ATAACTTTCA ATCCTATCTT
GTTGCCTTTG CATTTTACAT CTCTTATTTT GTGATGTCGG TTCCCTCTTC CTATCTTTTA
AAATCAGTAG GTTTTAAGAA AGGAATGATG ATCGGTTTCT GGACGATGGC GGTTGGTGCA
TTTATTTTTG TACCTGCTGC TTTTTCCCGT ACCTACGAAG TCTTCCTGCT GGGGCTGTTT
ACATTGGGCT CAGGCCTGGC CATTTTACAA ACGGCGGCCA ATCCCTACAT TACTGTGCTC
GGACCAAAGG AAAGTGCTGC CCAGCGCATC AGTATTATGG GTATATGCAA CAAAGGTGCA
GGTATACTTG CGCCTCTGCT GTTTGCTGCA GTCATATTAA GGGCAACAGA TGGTGACCTG
TTTAAGCAAT TGCCTTTAAT GGATGCCGCT GCCAAAAGTG CGGCGCTTGA CGAGCTCATC
AGAAGGGTAA TTGTACCTTA TTCCTGTGTA GGTACAGTAT TGCTGTGCCT GGGGCTGTTT
GTCCGCTTTT CTCCACTTCC CGAGATCAAT ACCGAGCATG AGAGTGAAGA TGTAGCCCTG
GCGAATTCCG GTAAAACCAG TATTTTTCAG TTTCCTCATT TGATTCTTGG TGCTTTTGGT
ATATTTCTGC ATGTAGGTAC ACAGGTTATC GCTATAGATA CCATTATCGG TTATGCCGGA
TCGATGAATA TCCAGCTACT TGAAGCAAAA GTATTTCCTT CTTATACGCT GTTTGTAACC
ATTTGTGGTT ATCTGTTAGG CATTACAACC ATTCCAAGGT TTATTAGCCA GGTCAATGCT
TTAAGGGTAT GCACCATTCT GGGTGGTATT TTTACTTTGT TAATTATTTA TGCAAAGGGA
CAGGTTATCT TTTTGGGTCA TGCTACTGAT ATCTCTATCT GGTTTGTAGT ATTGCTTGGT
TTTGCCAATT CACTGGTTTG GGCGGGTATG TGGCCGCTTG CCCTTGATGG TTTGGGCCGT
TTTACCAAAG TGGGAGCTTC TTTAATGATC ATGGGCTTAT GTGGAAATGC AATTATGCCA
CTTTTTTATG GATATTTTGC AGATTTATTT AACCTGAGAG CTGCATACTG GGTTTTATTC
CCTTGTTACG TTTACCTTAT ATTTTATGCA ATTTACGGCC ATAAACTTCG GAGCTGGTCC
TTTAAAACTT CTTAA
 
Protein sequence
MRKTNVAAVD VSSLSKWDTV ISIFIIGLLF FIFGFVSWVN AILIPYFKIA CELNNFQSYL 
VAFAFYISYF VMSVPSSYLL KSVGFKKGMM IGFWTMAVGA FIFVPAAFSR TYEVFLLGLF
TLGSGLAILQ TAANPYITVL GPKESAAQRI SIMGICNKGA GILAPLLFAA VILRATDGDL
FKQLPLMDAA AKSAALDELI RRVIVPYSCV GTVLLCLGLF VRFSPLPEIN TEHESEDVAL
ANSGKTSIFQ FPHLILGAFG IFLHVGTQVI AIDTIIGYAG SMNIQLLEAK VFPSYTLFVT
ICGYLLGITT IPRFISQVNA LRVCTILGGI FTLLIIYAKG QVIFLGHATD ISIWFVVLLG
FANSLVWAGM WPLALDGLGR FTKVGASLMI MGLCGNAIMP LFYGYFADLF NLRAAYWVLF
PCYVYLIFYA IYGHKLRSWS FKTS