Gene Phep_2887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2887 
Symbol 
ID8253997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3438901 
End bp3440877 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content46% 
IMG OID644936534 
ProductAlpha-galactosidase 
Protein accessionYP_003093147 
Protein GI255532775 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00523951 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGACCA CGCTGGTTTA CGGACAAAAA AACAATACCA TCTGGATGGA TGATTTAAGT 
ATCCGTACTT TTTCGGAAGG GATTCCGGCA GTGCTGGCAA AGACTTCAGG TTCCGGTGAG
GCGATCCGTA TGAAAGGCAT CACTTACAGC AGAGGGATTG GTGTGAATGG TACCAGTGTG
CTGAGTTTTT TACTGAATGG AAACGCCTCA GCATTTTCAG CGGTGGTGGG TGTAGATGAT
ATGGGGATGA AAGGTTTGCC GTATCGGTTT TATGTGATCG GTGACCGGAA AATTCTTTTT
GAAAGCGGAG ATATGAAATG GGGAGATCAA CCCAGAATGT TAAATGTAAA TTTAACAGGA
ATTAAGCGTT TGGGCTTGCT GGTGCTTGTT GAGCAGGGTA TAACCAAAAC CTACTCCAAT
TGGGCTGATG CTAAATTTAT CATGAAAGAT GAGCAGATGC CACTCAACAT TCCCAATACA
GATGAACGGA TTATTTTAAC CCCTGTTGCC GGAACTCAAC CTAAGATCAA TTCTGCGGCT
GTGTTTGGTG CCAGACCGGG GAATCCATTT TTGTATACCA TTGCTGCAAC CGGCGAAAGG
CCCCTGGTAT TTTCAGCCAG CAATTTGCCG GACGGGCTGC AGGTTGATGC GAAGACAGGT
ATCATTACAG GTAAGGTGTT AGAGAGAGGG GTGTATACCG TAACATTGAA AGCTAAAAAT
TCATCCGGTG AGTCTGTTAA ACAGCTGAGG ATAAAAATTG GCGATACCAT TGCATTGACA
CCACCTATGG GCTGGAATGG GTGGAACTCC TGGGCAAGAG CAATTGACCA GGAAAAAGTA
ATGGCATCAG CAGATGCCAT GGTAAAAATG GGACTGGCCA ATCATGGCTG GACTTACATC
AATATCGATG ATGCCTGGCA GGGGCAAAGA GGTGGAAAAT ACAATGCCAT TCAGCCCAAT
GAAAAGTTTC CTTCTTTTAA ACAAATGACA GATTATATCC ATAGCCTGGG CTTAAAACTC
GGTGTTTATT CCACTCCCTG GATCAGCAGT TACGCGGGTT ATCCTGGTGG TTCTTCCAAC
CTGGAGCATG GATTTTTCCC TGATGCTGTG CGGGACAATA AAAGAGCTTT CCGCTATATC
GGCAAGTACA GTTTCGAAAA AGAAGATGCC ATGCAAATGG CGGAATGGGG AGTTGATTAT
TTAAAATATG ACTGGCGGAT AGAAGTACCT TCGGCAGAAC GCATGTCGGT AGCCCTGAAA
AATTCGGGCA GAGACATCTT TTACAGCATT TCCAATTCGG CTCCTTTCAG CAACGTAAAA
GACTGGGTAC GGTTAACCAA TAGTTACCGT ACAGGACCGG ATATCAGGGA TAGCTGGTTA
AGCCTCTACG TAAGTGCATT TACACTCGAT AAATGGAGCC CTTATGGCGG ACCGGGGCAT
TGGAATGATC CGGACATGAT GATATTGGGC AATGTGACTA CCGGTTCTCC TTTACATCCA
ACCAGGCTTA CTCCGGATGA ACAGTATAGC CATGTGAGTT TATTCAGTTT ACTGGCCGCC
CCGCTGCTGA TTGGCTGCCC TATAGAACAA CTGGATGCCT TTACGCTAAA CCTGCTGACC
AATGATGAAG TGATTGCTGT AAACCAGGAC GCGCTGGGCA GGCCTGCAAG GTTGGTTGGA
GAAGAAAACG GTGTGCAGAT CTGGTTGAAA CAACTGGAAA ATAAGGAGTA TGCGATTGGC
CTGTTTAACA TCGACGGATA TACCAAAACG CCGCAGTCTT ATTTTCGCTG GGGTGATGAA
AAGCCTGTAT CCTTTACCCT GGATCTGACA AAAATCGGAT TGAAGGGAAA ATATACCATA
CGTGATGTAT GGAGGCAAAA GAACCTAGGT GAATTTGAAG GAACATTTAA CACCGGCATC
AGGCACCATG GCGTGGTAAT GATCAGGTTA ACGGCCCATC AATCAACAAA ACATTAA
 
Protein sequence
MGTTLVYGQK NNTIWMDDLS IRTFSEGIPA VLAKTSGSGE AIRMKGITYS RGIGVNGTSV 
LSFLLNGNAS AFSAVVGVDD MGMKGLPYRF YVIGDRKILF ESGDMKWGDQ PRMLNVNLTG
IKRLGLLVLV EQGITKTYSN WADAKFIMKD EQMPLNIPNT DERIILTPVA GTQPKINSAA
VFGARPGNPF LYTIAATGER PLVFSASNLP DGLQVDAKTG IITGKVLERG VYTVTLKAKN
SSGESVKQLR IKIGDTIALT PPMGWNGWNS WARAIDQEKV MASADAMVKM GLANHGWTYI
NIDDAWQGQR GGKYNAIQPN EKFPSFKQMT DYIHSLGLKL GVYSTPWISS YAGYPGGSSN
LEHGFFPDAV RDNKRAFRYI GKYSFEKEDA MQMAEWGVDY LKYDWRIEVP SAERMSVALK
NSGRDIFYSI SNSAPFSNVK DWVRLTNSYR TGPDIRDSWL SLYVSAFTLD KWSPYGGPGH
WNDPDMMILG NVTTGSPLHP TRLTPDEQYS HVSLFSLLAA PLLIGCPIEQ LDAFTLNLLT
NDEVIAVNQD ALGRPARLVG EENGVQIWLK QLENKEYAIG LFNIDGYTKT PQSYFRWGDE
KPVSFTLDLT KIGLKGKYTI RDVWRQKNLG EFEGTFNTGI RHHGVVMIRL TAHQSTKH