Gene Phep_3868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3868 
Symbol 
ID8255002 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4645267 
End bp4646619 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content47% 
IMG OID644937532 
Productsugar transporter 
Protein accessionYP_003094121 
Protein GI255533749 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGA GCCAGCAACC GGTGACTTTC AAAAACAGTT ATATTCTTTG CATATCATTT 
ATCTCGGCAC TTGGCGGATA CCTGTTCGGT TTCGACTTTG CCGTTATATC GGGTGCACTT
CCTTTTTTAC GTGTAGAGTT TGCGCTTAAT GCCTGGTGGG AAGGTTTTCT TACCGGTTCA
CTTGCGTTGG GTTGTATTGT TGGCTGTCTG ATGGCAGGCA ATTTAAGCGA CCGTTATGGC
CGTAAACCGG GCCTGATGCT GGCGGCACTC ATTTTTGCCC TTTCTTCCCT GGGAATGGCC
TTTTCTTCCG GGCTCAGTAT TTTTGTAATG ATGCGCTTTG CGGCAGGAGT TGGGGTAGGT
ATGGCCTCAA TGTTAAGCCC TATGTATATT GCGGAAGTTT CTCCCGCCAG TATCCGCGGC
CGGAATGTGG CCATTAACCA GCTTACCATT GTTATTGGCA TATTGATCAC TAACCTGGTC
AATTATACAC TTTCAGACAA TGGCCCGGAA GCCTGGCGAT GGATGTTTGG CCTGGGGGCA
GTGCCTTCTC TCCTTTTTTT ACTGGGCGTG GTGTGGCTTC CTGAAAGTCC GAGGTGGTTA
ATTAAAGAAG GCCGTTTGGA AAAAGCGAAG GCAGTATTGA ACAAGATCGG CAGTTCAGCC
TATGCACAGA ACATCTATAA CGACATTGAG CTTTCCCTTA GAGGGGGAGA AAAACAATCT
TATAGAGCTG TATTGGCAAA GGGTGTGCGT CCGGCAGTAA TTGTGGGCAT CACGCTGGCT
GTATTCCAGC AATTGTGTGG CATCAATGTC GTATTTAATT ATACCTCAAC CATATTTGAG
TCGGTGGGTG CCAGTCTGGA CCGCCAGTTG TTTGAAACGG TTGCCATTGG CATTGTAAAT
CTTGTTTTTA CCCTTGTTGC CATGTGGCAG GTTGACAAGC TGGGCCGCAG GCCATTGATG
CTGATCGGTT CCCTGGGCCT GTCTGTAGTA TATATTATCC TGGCATTTTT ACTTCAAAGC
CATGCTGCAG CGGGTATCGT TTCTGTATTT GTATTGTTGG CAATAGCCAT GTATGCCACC
TCACTTGCAC CGGTTACCTG GGTGCTCATT TCAGAGATCT TTCCAAATAA AATCAGGGGT
GTAGCCTCTT CAATTGCGAT TGTATCCCTT TGGGGGGCCT ATTTTATCCT GGTATTTACA
TTCCCCATCC TTGCAGAAAA ACTGGGTACC TATGGCCCTT TTTACCTGTA TGCCGGAATT
TGCCTGCTTG GCTTCCTGTT TGTGAAATCC AAGGTGCGTG AAACCAAAGG AAGGACACTT
GAAGAACTGG AGCAGGATTT GGTCAGACAT TAA
 
Protein sequence
MSTSQQPVTF KNSYILCISF ISALGGYLFG FDFAVISGAL PFLRVEFALN AWWEGFLTGS 
LALGCIVGCL MAGNLSDRYG RKPGLMLAAL IFALSSLGMA FSSGLSIFVM MRFAAGVGVG
MASMLSPMYI AEVSPASIRG RNVAINQLTI VIGILITNLV NYTLSDNGPE AWRWMFGLGA
VPSLLFLLGV VWLPESPRWL IKEGRLEKAK AVLNKIGSSA YAQNIYNDIE LSLRGGEKQS
YRAVLAKGVR PAVIVGITLA VFQQLCGINV VFNYTSTIFE SVGASLDRQL FETVAIGIVN
LVFTLVAMWQ VDKLGRRPLM LIGSLGLSVV YIILAFLLQS HAAAGIVSVF VLLAIAMYAT
SLAPVTWVLI SEIFPNKIRG VASSIAIVSL WGAYFILVFT FPILAEKLGT YGPFYLYAGI
CLLGFLFVKS KVRETKGRTL EELEQDLVRH