Gene Phep_2207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2207 
Symbol 
ID8253313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2541370 
End bp2542560 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content48% 
IMG OID644935856 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003092473 
Protein GI255532101 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.78926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCCAT CTTCCCATTC AATTTATACC CTTCAGTTCG GTCTCGTTTG TTTAAGCTCT 
TTCCTTTTTT CTGCCAGCTT TAATATGCTC ATCCCCGAAC TTCCTGCTTA TTTAACCGCA
ATGGGGGGCG CAGCGTATAA AGGGCTCATT ATTGCACTGT TTACACTCAC TGCCGGAATA
TCCAGACCTT TCAGTGGTAA ACTCACGGAT ACCATTGGCC GTGTACCGGT AATGGCAGTG
GGCTCACTGG TTTGTTTTTT ATGCGGTTTT CTTTATCCGC TCCTCACTAC CATTGCCGGA
TTTCTGTTCC TCAGGTTAAT ACATGGTTTC TCTACAGGCT TTAAGCCTAC CGCTACGGCC
GCCTATGTGG CCGATCTGGT TCCTCCAGGA AAATGGGGCG AGGCGATGGG GGTACATGGT
GTATGTTTTA GCACCGGCCT GGCCATTGGC CCCGCAATTG GCAGTACCAT CACCGATCAT
TACAGCATCA ATGTGCTGTT TTACTGTTCT TCTTTATTTG CCCTGCTTTC CATTGTTATC
CTGGCCAACA TGAAAGAAAC CCTGCCAGGC AAACAAAAAT TCCGCGCAGC ACATTTAAAG
ATCAATAAAA AAGACATCAT TGAATGGCGG GTGATCCCGG CCGTGGTGAT CATCTTTTTA
AGTTACATCA GCTATGGCTC CATACTCACG GTCATATCCG ATTGGAGTGC ACACCTGGGC
ACCAGTAACA AAGGTCTGTT CTTTATGGTT TTTACGCTCA CTTCTTTATT GATCCGCTTT
GTGGCCGGTA AGGCATCCGA CAGGTATGGC CGTACACTCA TTTTAAGAAT ATCCCTCGGC
CTGCTGGCTG TATCCCTGAT GCTGATCGCC ATAGCCAGTT CTTCTTTTAC CCTGATGATG
GCATCTGCTT TATATGGGGT GGCTACAGGT ATGCTCTCGC CAACAGCAAC GGCCTGGACG
GTAGACCTGA GCGAACCCAC ACAAAGGGGT AAGGCCATGG CCACCATGTA CATTGCCCTC
GAAGCTGGTA TTGGTTTGGG TGCACTCCTT GCCGGCTGGT TGTTTATAGA CAATATCCGC
ATGATCCCTG TAACTTTTTA CTGCTGTACA GGCATTACAC TGATTGCCCT GGTTTACCTT
CAGTTTTTTT ACCGGACAAA GCAGTACATT TCTCCTAAAA ATGGCAGTTA G
 
Protein sequence
MQPSSHSIYT LQFGLVCLSS FLFSASFNML IPELPAYLTA MGGAAYKGLI IALFTLTAGI 
SRPFSGKLTD TIGRVPVMAV GSLVCFLCGF LYPLLTTIAG FLFLRLIHGF STGFKPTATA
AYVADLVPPG KWGEAMGVHG VCFSTGLAIG PAIGSTITDH YSINVLFYCS SLFALLSIVI
LANMKETLPG KQKFRAAHLK INKKDIIEWR VIPAVVIIFL SYISYGSILT VISDWSAHLG
TSNKGLFFMV FTLTSLLIRF VAGKASDRYG RTLILRISLG LLAVSLMLIA IASSSFTLMM
ASALYGVATG MLSPTATAWT VDLSEPTQRG KAMATMYIAL EAGIGLGALL AGWLFIDNIR
MIPVTFYCCT GITLIALVYL QFFYRTKQYI SPKNGS