Gene Phep_3453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3453 
Symbol 
ID8254573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4105895 
End bp4107157 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content41% 
IMG OID644937105 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003093708 
Protein GI255533336 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.403237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0945543 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACAAC CTCAAAAAAA TATAGCCGAG CCTTTTAGTT CTTACCAACT GCTGGTTATT 
GCCTTGCTGG CTTTACTGCA GTTTACAATT GTACTGGACT TTATGGTACT TGCGCCACTT
GGCGACTTTT TAATGAAATC GTTATCCATA AGCCCTAAAG GTTTTGGATT GGTCGTCTCT
TCCTATGCTT TTAGTGCAGG TGCTTCAGGA ATTATGGCTG CCGGTTTTGC CGATAAATTT
GACCGTAAAA AGTTGCTGCT GTTTTTTTAC AGCGGTTTTA TTATAGGAAC CTTGTGTTGT
GCACTTGCCA CCAATTACGA GATGCTACTT GGTGCAAGGA TTGTAACCGG TTTATTTGGT
GGTGTAATCG GCGCCATCTC TATGACAATC ATTACAGATA TTTTTGCCGT TCACCAACGT
GGCAGAGTGA TGGGGGTTGT GCAGATGGGT TTCGCTGCAA GCCAGGTACT GGGTATACCC
ATTGGTTTGT ATTTTGCCAA TATATGGGGC TGGCATTCTT CATTTCTGAT GATTGTGATA
TTGGCAATAA TGATCGCAAT TGCAATTCTG ATCAAGATAA AACCAATTGA CAAGCATCTG
GCCATACAAT CAGACAAAAG CGCCTTCCTG CATTTGTGGC ATGCGGTTTC TAACCGTTCC
TATCAGACAG GATTTATTGC AACTGCATTT ATGGGTGTTG GTGGTTTTAT GTTAATGCCA
TTTGGAAGTG CTTATCTGAT CAATAACATC AACATTACTG AAGCGCAACT GCCATTGGTA
TTTATGTTTA CCGGCCTGGC TGCTGTTGTT GTAATGCCAT TAATTGGAAA ATTAAGTGAT
AAAGTAGACA AGTTTATGGT GTTTACTGGT GGGTCATTGC TTGCAGTGGT AATGATTCTG
GTATACACTA ACCTTAGTCC GGTTCCATTA TGGCAGGTTA TCGTGATCAA TATGGTCTTA
TTTATGGGGG TGATGAGCAG GATGATTCCT GCAACTACAC TTACGATGAG CATCCCTGAC
CTAAATGACA GGGGGGCTTT TATGAGTGTC AATGCTTCTA TACAACAAAT GGCCGGTGGT
ATTGCTGCGT TATGTGCTGG TTTGATCGTT ACACAGAGAA CAAAGAGTAG TCCACTGGAG
CATTATGATA CTTTAGGTAT AGTGGTATCG GCACTTATAC TTTTATGCAT ATTTTTGGTT
TACCGTGTAA GTGTAATGGT GAAAAAGAAA GACGCTGTAT TGAAAATCCC AGCTAAGCAC
TGA
 
Protein sequence
MQQPQKNIAE PFSSYQLLVI ALLALLQFTI VLDFMVLAPL GDFLMKSLSI SPKGFGLVVS 
SYAFSAGASG IMAAGFADKF DRKKLLLFFY SGFIIGTLCC ALATNYEMLL GARIVTGLFG
GVIGAISMTI ITDIFAVHQR GRVMGVVQMG FAASQVLGIP IGLYFANIWG WHSSFLMIVI
LAIMIAIAIL IKIKPIDKHL AIQSDKSAFL HLWHAVSNRS YQTGFIATAF MGVGGFMLMP
FGSAYLINNI NITEAQLPLV FMFTGLAAVV VMPLIGKLSD KVDKFMVFTG GSLLAVVMIL
VYTNLSPVPL WQVIVINMVL FMGVMSRMIP ATTLTMSIPD LNDRGAFMSV NASIQQMAGG
IAALCAGLIV TQRTKSSPLE HYDTLGIVVS ALILLCIFLV YRVSVMVKKK DAVLKIPAKH