Gene Phep_2211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2211 
Symbol 
ID8253317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2547977 
End bp2549218 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content47% 
IMG OID644935860 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003092477 
Protein GI255532105 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.895405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAATT CCGGACAAAC CGTTTCCTTT TACGCATGGC TCATTGTCGC CCTGCTATGG 
ATAGTGGCCT TTTTAAATTA CCTCGACCGC ATCCTGATCA CTTCTATGCG CGATCCGATT
GTTGCCGATT TCAATTTATC TGATGCGCAA TTTGGCTTAC TTACATCGGT ATTCCTCTGG
TCTTACGGTA TACTGAGCCC TTTTGGGGGT TTTTTCGCCG ACAGGTACAG CAGAAAAAAA
GTTATCGTGT TCAGTGTAAT GGTCTGGTCG GCCGTAACCA TCTGGACAGG CTATGCCACT
TCATTTCATG AAATGCTGGC AGCCCGTTTC TTAATGGGGG TAAGTGAAGC CTGTTATATA
CCGGCAGCCC TTGCCCTGAT CACTGATTAC CATAAAGGTC GCACACGTTC ACTGGCAACC
GGATTACACA TGAGCGGCTT ATATGCAGGC CTTGCCCTGG GCGGTCTTGG CGGTTACATC
GCAGAACTAT GGGGCTGGCG TTCTGGCTTC CATATTTTTG GAGCAGTAGG GATTGTGTAT
TCTTTGATAC TTTTATACAT TTTAAAAGAC CAGAAAGCTT CCGCAGAAAC AGCAGAAACA
GCAGAAACAA GTACCCAAAC CACTGGCATT AGTCTTACCG GTGCCTTGAA AGTCTTGTTC
AGCGAAGCCT CTTTCCTCAT CCTCCTCATC TATTTTGCCG TTCTTGGTAT CGTAAACTGG
CTGGTTTACG GCTGGCTGCC AACCTTTCTC AAAGATCATT TCAACCTTAA CCTCGGCGAA
GCCGGCATTT CTGCAACGGG TTATATCCAG ATCGGTTCTT TTATAGGTGT AATTGTGGGG
GGCATACTGG CCGACAGGTG GACAAGGAAA AACAACCGCG GCCGACTCTA CATCCTCATT
ATTGGGTTTA CCTTGGGTGC ACCATTCTTA TTTCTAATGG CCTCAACCAG CATTTTTAGC
ATCGCAATCC TGGCCATGCT CATCTTCGGC CTGGCCAGGG GATTTAATGA TGCCAATATG
ATGCCCATAT TACGGCAGAT AGCCGATGGA CGGTATATTG CAACGGGCTA TGGCTTTCTT
AACTTTTTAA GCACAATTGT AGGCGGACTG ATGGTTTACG CTGGCGGCGC ATTAAAAGAT
GCCCAGGTAG ACCTTTCCAT TGTTTACCAG ATCTCAGCTG TCGTTATGCT ATTAGCCACT
TGGCTATTAT TTGCAATAAA GCTCAAAAAC AGCAATTCCT GA
 
Protein sequence
MKNSGQTVSF YAWLIVALLW IVAFLNYLDR ILITSMRDPI VADFNLSDAQ FGLLTSVFLW 
SYGILSPFGG FFADRYSRKK VIVFSVMVWS AVTIWTGYAT SFHEMLAARF LMGVSEACYI
PAALALITDY HKGRTRSLAT GLHMSGLYAG LALGGLGGYI AELWGWRSGF HIFGAVGIVY
SLILLYILKD QKASAETAET AETSTQTTGI SLTGALKVLF SEASFLILLI YFAVLGIVNW
LVYGWLPTFL KDHFNLNLGE AGISATGYIQ IGSFIGVIVG GILADRWTRK NNRGRLYILI
IGFTLGAPFL FLMASTSIFS IAILAMLIFG LARGFNDANM MPILRQIADG RYIATGYGFL
NFLSTIVGGL MVYAGGALKD AQVDLSIVYQ ISAVVMLLAT WLLFAIKLKN SNS