Gene Phep_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1023 
Symbol 
ID8252117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1199203 
End bp1200498 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content45% 
IMG OID644934677 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003091306 
Protein GI255530934 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0652623 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACGA CTAAAATTGG TAATTATCGC TGGATTGTTT GCGGACTGCT CTTCTTTGCA 
ACAACCATTA ATTATATAGA CCGGCAGGTG ATCGGGTTGT TAAAACCCAC GCTGGAAAAA
GAGTTTCACT GGACAGAGGT CGATTACGGC TATATCGTGA TGGCCTTTGC CGGTATGTAT
GCACTGGGCT ATGTGGTTTT TGGTAGTTTT ATTGATAAAG TTGGTACCAA GATCGGTTAT
AGCATATCTG TTATAGTGTG GAGCATCGCT GCCATGCTGC ATGCTGTAGT TAAAGGCACA
GTAGGTTTTG GTATGGCCAG GGGATTACTG GGGCTATCAG AGGCCGGTAA TTTCCCTGCG
GGTGTTAAAG CTGTGGCAGA ATGGTTCCCT AAAAAAGAAC GGGCCCTGGC TACAGGAATA
TTTAATTCTG GTACCAGCAT CGGTGCAGTG GTGGCCCCAA TTCTGGTACC CTGGATTTTG
GGTATCTACG GATGGCAGGA AGCCTTCTGG ATTACGGGCG CGCTGGGCTT TATCTGGCTC
ATATTCTGGT GGATCTTTTA TGAAATCCCA TCCAAACAAA AGCGGCTTAA AAAGGCCGAG
TATGATTTTA TACACAGCGA TCCAGATGAT GAGCTGCAGG AAAAGCCGGT AAAATTGAAA
TGGATCCAAC TGCTTGGTTT ACGCCAAACC TGGGTATTCA TTGTAGGTAA GGTACTTACC
GATCCGATCT GGTGGTTTTT CCTGTTCTGG TTACCGGCCT ATTTTGCAGA TACTTTTGCC
TTAGACCTTA AAAAACCCAG CCTGCACCTT GCCGTAGTTT ACGCAGCCAC AACGTTTGGC
AGCATTGGCG GGGGCTACCT GTCGTCTTAT TTTATCAAAC GCGGATGGCC GGTACTTAAA
GCCAGAAAAA CCACCCTGCT CATCGTGGCC ATTGCAGTGG TGCCTATATT TTTTGCCCAG
TTTGCTCCCA ATATCTGGGT AGCGGTGGCC ATCATCAGCA TTGCAACCGC TGCGCATCAG
GCATGGAGTG CCAATATTTT TACCATCGTT TCAGATATTG TGCCTAAACA AGCCGTAAGT
TCTGTGGTAG GTATCGGTGG TATGTCGGGT TCCATTGCTT CAACTTTATT TCCGCTGCTG
GTGGGTTCAC TGCTTGCCTA TTATAAAAAT ATTGGTAACA TTGGTGCTGC TTACAATATT
TTGTTCATCA TCTGTGGCTG TGCTTATTTC CTGGCATGGT TCATTATACA AATGCTTACC
AAAAAAATGA AACCTGTTGA TTTTAGTAAC CTATAA
 
Protein sequence
METTKIGNYR WIVCGLLFFA TTINYIDRQV IGLLKPTLEK EFHWTEVDYG YIVMAFAGMY 
ALGYVVFGSF IDKVGTKIGY SISVIVWSIA AMLHAVVKGT VGFGMARGLL GLSEAGNFPA
GVKAVAEWFP KKERALATGI FNSGTSIGAV VAPILVPWIL GIYGWQEAFW ITGALGFIWL
IFWWIFYEIP SKQKRLKKAE YDFIHSDPDD ELQEKPVKLK WIQLLGLRQT WVFIVGKVLT
DPIWWFFLFW LPAYFADTFA LDLKKPSLHL AVVYAATTFG SIGGGYLSSY FIKRGWPVLK
ARKTTLLIVA IAVVPIFFAQ FAPNIWVAVA IISIATAAHQ AWSANIFTIV SDIVPKQAVS
SVVGIGGMSG SIASTLFPLL VGSLLAYYKN IGNIGAAYNI LFIICGCAYF LAWFIIQMLT
KKMKPVDFSN L