Gene Phep_2105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2105 
Symbol 
ID8253210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2425499 
End bp2427151 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content47% 
IMG OID644935754 
Productalpha-glucan phosphorylase 
Protein accessionYP_003092372 
Protein GI255532000 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0058] Glucan phosphorylase 
TIGRFAM ID[TIGR02094] alpha-glucan phosphorylases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.669842 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.393403 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTATCTA AAAAAGACAT TTTCGGCTTT GAGCCCAACG CGCAGTACAG TACTGCTGCT 
GCTTATTTTT CTATGGAATT TGCCGTAGAT CAGGCGCTAA AAATTTACAG TGGCGGGCTC
GGTTTTCTTG CCGGCTCGCA CTTAAGGAGT GCTTATCAGT TAAAACAGAA CCTGCTAGGT
ATTGGCATTT TATGGAAATA TGGTTATTAC GACCAGACCA GGGACAAAGA CCAGCTCATG
AAACCTGCCT ATACCGAAAA GCAATATGCA TATCTGCAGG ATACAGGCAT AATTTTTACC
GTTCCGGTAC ACGATTCGCC TGTGCATGTA AGGGCCTATC TGCTAAAACC GGAAACCTTT
GGTACAGCAC CACTTTTTTT GCTCAGTACC GATGTTCCTG AAAATGACTA CCTGTCGCGT
ACCATTACCC ATCGTTTGTA CGATCCGCAT GAGACCACAA GAATTGCCCA GTCTATCATT
CTGGGTATTG GCGGGGCCAT GTTACTGGAT ATCTTAAACA TCACACCCGA TGTATACCAC
ATGAACGAAG GGCATTCCGT ATCACTTAAT TTTTACCTCT ACGCTAAATA TAAAAGCCTG
GATGAAGTAA AAAAACGGGT GGTATTTACT ACCCATACCC CCGAAATGGC CGGAAATGAA
GAACATAGAT ACAGTTTGCT CAAGGAAATG TCTTTCTTTT ACCATTTACA GGAGCATGAG
GTGAAATACC TGCTGGGGAT GGATGGCGAC CAGTTTAGCT ATACCCTTGC AGCGCTTAAA
TTTGCCAGAA AGGCCAATGG TGTTTCTGAA CTGCATGGTA AGGTGGCCAG GGATATGTGG
GGCCATAACC CCGGTATATG CGAAATCACT TCCATCACCA ATGCCCAGAA CCTAACCTAC
TGGCAGGATC CGCTGATGGG AAAATCAATA GCCGGTAATG ATGATCATGG TATTGTTCAA
CGAAAAAAAG AGCTGAAGGC CGATCTGTTT AAAGTAGTGG CCGATCAGTG TGGCAAACTG
TTTGATCCGG AGGTGATCAC TATAGTATGG GCCAGACGTT TTGCCGGTTA TAAGCGGGCC
GACCTGGTGA TGCAGGACTG GAACCGCTTT TTAACCCTGC TGAGCAATGG CGCATTTCCG
GTACAGCTGA TCTGGGCCGG GAAACCTTAT CCTGAAGATT TTGGTGCCAT TGGCTTGTTT
AACCAGATCA TTTCGAGGGC ATTGCCTTTA AAAAACTGTG CTGTACTTAC AGGCTATGAG
CTGGAATTGT CGGCCCTTCT TAAAAAGGGA TCTGATGTTT GGTTAAACAA CCCCAGGATG
TACAGGGAAG CCTCCGGCAC CAGTGGCATG ACTGCCGCAA TGAACGGCAG CATCAACCTT
TCCCTGCCCG ATGGCTGGGT ACCTGAATTT GCCCGGGACA GAGAAAACTG TTTCCTGATC
CAGCCCGCAC CGGATCATTC TCCGGAACAG GATCAGGACC GGCTGGAGAA CATCAGCCTG
ATGGATACAC TGGAACAGGT TGTACTGCCA ACCTATTATA ACGACCATAA CAAATGGTTA
GGCATGGTAA AAAAGGCGGC AGCTGATGTG GTCCCTGCTT TTGAATCGGG CAGAATGGCG
GCAGAATATT ACACAAAAAT GTATAAGGCT TAA
 
Protein sequence
MLSKKDIFGF EPNAQYSTAA AYFSMEFAVD QALKIYSGGL GFLAGSHLRS AYQLKQNLLG 
IGILWKYGYY DQTRDKDQLM KPAYTEKQYA YLQDTGIIFT VPVHDSPVHV RAYLLKPETF
GTAPLFLLST DVPENDYLSR TITHRLYDPH ETTRIAQSII LGIGGAMLLD ILNITPDVYH
MNEGHSVSLN FYLYAKYKSL DEVKKRVVFT THTPEMAGNE EHRYSLLKEM SFFYHLQEHE
VKYLLGMDGD QFSYTLAALK FARKANGVSE LHGKVARDMW GHNPGICEIT SITNAQNLTY
WQDPLMGKSI AGNDDHGIVQ RKKELKADLF KVVADQCGKL FDPEVITIVW ARRFAGYKRA
DLVMQDWNRF LTLLSNGAFP VQLIWAGKPY PEDFGAIGLF NQIISRALPL KNCAVLTGYE
LELSALLKKG SDVWLNNPRM YREASGTSGM TAAMNGSINL SLPDGWVPEF ARDRENCFLI
QPAPDHSPEQ DQDRLENISL MDTLEQVVLP TYYNDHNKWL GMVKKAAADV VPAFESGRMA
AEYYTKMYKA