Gene Phep_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1904 
Symbol 
ID8253008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2201622 
End bp2202914 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content44% 
IMG OID644935555 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003092174 
Protein GI255531802 
COG category[R] General function prediction only 
COG ID[COG2270] Permeases of the major facilitator superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACAAA AAAACGACAA GAAGGTAATC CGCTCATGGG CATTCTTCGA CTGGGCAAAT 
TCTGCTTATA ACCTGGTCAT CACCTCAACT ATATTTCCGG CCTATTATAC CATTATTACC
ACCACGAAGG AACATGGCGA CCAGGTCATT TTTTTTGGTC ACCAGTTTAC CAATACGGCC
CTGTCCAATT ATGCGCTTTC TTTTGCCTAC CTGATCATGG TGATCCTGAT GCCCATCCTG
ACTTCGATGG CCGATTACAG GGGGAACAAG AAGGTTTTTA TGAAATTGTT TACCTATATA
GGGGGGATTG CCTGTATGGG TCTGTATTTT TTTAAGCTGG ACACACTGGA GCTGGGCATC
ATTTGTTTCG TAATTGCAGC GATTGGCTAT GTAGGTGGTG TGGTATTCAA CAATTCTTAT
CTGCCTGAGA TTGCTACACC AGACCAGCAG GATAAGGTGA GTGCGAAGGG CTATTCCTAT
GGTTATGTGG GCAGTGTGCT GTTGCAGCTG ATCTGTTTTG TTTTTGTACT GAAGCCTGAA
TTGTTTGGGA TTACGGATCT TTCTTTTCCA CCCCGGCTAT CGTTTTTGCT GGTTGGTTTG
TGGTGGATCG GTTTTGCACA GATCTCTTTC AGGAAACTGC CGCCTGGCAG CCCCAATTAT
GTGGCCATCA ATAAGCATGT GATCCATAGT GGTTTTCAGG AACTGGCTAA AGTTTGGAAA
CAGTTAAGAC ACCTGGAATT TCTGAAGAAG TTTTTACCTG CTTTCTTCTT TTACTCGATG
GGTGTGCAGA CCATTATGCT TGCTGCTGCA GGTTTCGGCG AAAAAACACT GAACCTGGGA
ACAGCCAAAC TGATTGCTGT AATTTTAATT ATACAGCTGG TGGCCATTGC CGGTGCCATG
CTGATGTCGC GTTTTGCCGA AAAGTTTGGA AATGTACGGG TGCTGATCTT TGTGGTGTTC
ATCTGGATAG GGGCCTGCGG CTGCGCTTAT TTTGTCAGCA ATGAATACCA GTTTTATGCA
TTGGCTGCAG TTGTAGGGAT GATTATGGGC GGGATCCAGT CTTTATCCAG GTCTACCTAC
TCTAAATTTT TACCGGCTAA TACGCCTGAC ACCGCCTCTT TTTTTAGTTT TTATGACGTT
ACAGAGAAAC TGGCCATTGT GATCGGACTG TTCAGCTTTG CTTTTATAGA GGAAGCAACA
GGTAGCATGC GGAATTCAAT TATTGCTTTG GCATCATTTT TCGTTATAGG TTTGGTATTT
TTGCTGCTGC TTAGAAAAGT TGAAATGAAA TGA
 
Protein sequence
MEQKNDKKVI RSWAFFDWAN SAYNLVITST IFPAYYTIIT TTKEHGDQVI FFGHQFTNTA 
LSNYALSFAY LIMVILMPIL TSMADYRGNK KVFMKLFTYI GGIACMGLYF FKLDTLELGI
ICFVIAAIGY VGGVVFNNSY LPEIATPDQQ DKVSAKGYSY GYVGSVLLQL ICFVFVLKPE
LFGITDLSFP PRLSFLLVGL WWIGFAQISF RKLPPGSPNY VAINKHVIHS GFQELAKVWK
QLRHLEFLKK FLPAFFFYSM GVQTIMLAAA GFGEKTLNLG TAKLIAVILI IQLVAIAGAM
LMSRFAEKFG NVRVLIFVVF IWIGACGCAY FVSNEYQFYA LAAVVGMIMG GIQSLSRSTY
SKFLPANTPD TASFFSFYDV TEKLAIVIGL FSFAFIEEAT GSMRNSIIAL ASFFVIGLVF
LLLLRKVEMK