Gene Phep_3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3044 
Symbol 
ID8254156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3638901 
End bp3640235 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content44% 
IMG OID644936693 
ProductGluconate transporter 
Protein accessionYP_003093304 
Protein GI255532932 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG2610] H+/gluconate symporter and related permeases 
TIGRFAM ID[TIGR00791] gluconate transporter 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0184086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACCAG AAACTTCTTT TATATATCTG CTGATCTGCC TGCTTGCCGG CATATCCATC 
ATTGTATTGC TGACTACAAA ATTTAAAGTA CCGGCAGCAT TTGCGTTAAT AATAGGCTGC
TTTCTTGTTG GCTTAGGCGC ACACTTATCC TTAACAGATG TGGTAAATAT CATGAAGGAA
GGTTTTGGCA ACATTATCAA ATCACTCGGA CTGATTATAT TGTTGGGGAC AACTTTAGGG
ATATTACTGG AATATACAGG CAGCACATCA GTTATGGCCA ACTACATCTT AAAAAAACTT
GGCGAAAAAC GAACAGTATG GGGCATCAGC ATTACAGGGT TTATTGTAGG CTTACCTATT
TTTTGCGACT CTGGTTATAT CGTATTGAGT GGTTTAAATA AGACATTGGC AAAACGCGCC
GGCATATCCA TTGTAATTAT GTCAGTATCC CTGGCAACAG GCCTGTATTC TGTTCATTGT
ATGGTTCCAC CCCATCCCGG AAGTGCCGCC GCCGCAGGTA TAATTGGAGC AGATATTGGC
AAATTGATAT TGATTGGCAG CCTTGTAGCC ATACCGGCTA TGATGGTCGG AAATATATGG
GCCCGGTATG CAGGTAAAAA TCTCCCTCTA CCTGTAGTTG AAGAAGAAAC TCCAACGGAT
GTACGGTTGC ATCAGCCATC TGTTATCCAG TCGTTCCTGC CTATTGTAGT TCCTATAGTC
CTTATTGCCC TAAAATCTTT CGTTACTATA GAAGCCACCC CGGATAAAGC TTGGATGACC
GCGATCCTTT CTTTTGGCGA TCCTGTAATT GCCTTAATCA TTGGCATTTT ACTTACTTTC
TTCTGCAAAA AATCGTGGAA AAGAGCAGAA CTTGGCCACC TTTTACAGGA TTCGGCCGAA
AAAGCAGGTG GAATCCTGGT TATTATTGGT GCCGGCGGTG CATTTGGAGC CATACTTGCC
GCCATCAAGA TCGGCACCCA CTTAAGCGAA TCGCTAGCGT TGGATAGCAT GGGGCTGTTC
TTCCCATTTC TGCTCACTTT CGTCTTAAAA ACAGCCCAGG GCTCATCAAC AGTAGCCATT
ATCACTGCCG CTTCTATTGT CCTTCCCCTG CTGCAGGTAT TAGGGTTGGA TACTGAAACC
GGCAAACTGC TCTGCGTACT GGCCATGGGT GCCGGATCTA TGATGATCTC TCATGCCAAT
GATGCCTATT TCTGGGTTAT TGCAAAGTTC TCCGGCTTAG ACATGAAAAC AATGCTCAGG
GTTTATTCAG TGGCTACCGT CCTGATGGGG CTTACTTCAT TTGCCATGGT TTATATTTTA
TCGAAGTTCT TATAA
 
Protein sequence
MPPETSFIYL LICLLAGISI IVLLTTKFKV PAAFALIIGC FLVGLGAHLS LTDVVNIMKE 
GFGNIIKSLG LIILLGTTLG ILLEYTGSTS VMANYILKKL GEKRTVWGIS ITGFIVGLPI
FCDSGYIVLS GLNKTLAKRA GISIVIMSVS LATGLYSVHC MVPPHPGSAA AAGIIGADIG
KLILIGSLVA IPAMMVGNIW ARYAGKNLPL PVVEEETPTD VRLHQPSVIQ SFLPIVVPIV
LIALKSFVTI EATPDKAWMT AILSFGDPVI ALIIGILLTF FCKKSWKRAE LGHLLQDSAE
KAGGILVIIG AGGAFGAILA AIKIGTHLSE SLALDSMGLF FPFLLTFVLK TAQGSSTVAI
ITAASIVLPL LQVLGLDTET GKLLCVLAMG AGSMMISHAN DAYFWVIAKF SGLDMKTMLR
VYSVATVLMG LTSFAMVYIL SKFL