Gene Phep_0084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0084 
Symbol 
ID8251168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp93777 
End bp95621 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content45% 
IMG OID644933733 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_003090372 
Protein GI255530000 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.190697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAGAC TAAACCTTAG TTTGCTGCTA CTGCTTATGG TTCATTTTGC GCATGCGCAA 
TGGAAACCTG TTCCTGGAAA AATTTCAACA GACTGGGCAG CTAAAGTAAA CCCATCCAAC
CCTTTACCTG AATATCCGAG ACCACAACTG GTCCGTAAAA GCTGGATAAA CCTTAACGGG
TTATGGCAAT ATGCCATTTT ACCTAAAGGC AGTGATAAAA TACCGGTAAG CTATGCAGGG
CAGATCCTGG TTCCTTATGC GGTAGAATCT TCACTGAGTG GTGTGGGTAA AATGGTTGGT
GAAAACAATG TATTGTGGTA TAACCGCAGC ATAGACCTGC CAGCAAAAAT GATGGGAGGG
AAATTATTGC TTCATTTTGG AGCTGTAGAT TGGTCTTGTA GGGTGTATGT AAACGGAAAG
CTTGCAGGCG AGCATGCCGG GGGCTATGAT GCCTTTTCCT TCGACCTTAC TTCGCTGGTC
AGAAAGGGTG TTAAACAGAA TATTTCCGTA CAGGTCTGGG ATCCTACAGA TGATGGACCA
CAGCCACGGG GCAAACAGGT TAAACAACCC AAAGGTATTT GGTATACACC TGTTACAGGG
ATATGGCAAA CCGTATGGCT GGAAAATGTA CCTCAAACTT ATATTGTAGC TACCAAACAA
ACGCCTGATA TTGATAAAAA ACAATTGGCT GTTCGGGTAG AAGTTGCTGA TTTACAATCG
GGCGATCAGC TGGAAGTTAC TGCCTGGGAG GGTGCCAAGC GGGTTGCTTT ACAGGCCGGT
GATCCGCAAA GAGAATTGAT ACTGAACATT CCTGATCCCA GGTTATGGTC ACCGGAATCG
CCATTTCTAT ATGATCTGAA AGTAGCGGTT AAGAGAAAAG GGAAAACGAT TGATGAGGTT
GCGTCCTATT TTGGAATGCG TAAATCTGCT ATGGCTAAGG ATGCCGCCGG CATACAAAGG
ATGACGCTGA ACAATCAGTT TGTATTTCAA TACGGGCCTT TAGATCAGGG CTGGTGGCCG
GATGGTTTAT ACACGGCCCC AACAGATGAA GCATTAAAAT TTGATATCGA AAAAACGAAG
GCTTTGGGAT TTAATATGAT CCGGAAGCAT GTTAAAGTAG AGCCGGCCAG ATGGTATTAC
CATTGCGATA AAATGGGCAT GCTGGTATGG CAGGATATGC CAAGCGGCGA TACCGGAGGA
AATGTATGGG ATGCGAAACC TGGTTTTATC ACCGGCGGCA AGCTGGATAA GGACCGTAGC
CCGGAATCGG AAAATATTTT CAGAAAGGAG TGGAAAGCCA TTATGGATCA GTGCTATAAT
TATCCGAGCA TTGTTTCCTG GGTTCCTTTT AACGAAGCAT GGGGCCAATT TAAAACCAAA
GAAATTGTAG ACTGGACGAT GAAATATGAT CCATCGCGCC TGGTAAATGC AGCGAGTGGC
GGAAATTATT TTATGGGCGC AGGACAGGTG CTCGACCTCC ACATTTATCC GGCACCTGCG
ATGCCTGATC CTGCAATTTT TGGTGCCCGG CAAGCTTTGG TGTTGGGTGA ATTTGGTGGA
TTGGGGCTGC CGATTGAGGG GCATACCTGG CTGGACAAGG GCAATTGGGG ATACCAGAGT
TATAAAAACA AGGAGGATCT GTTTGCACAG TATACCAAGT TTATCAGCGC TATTCCTAAA
TTGATCCGTT CGGGCTTATC TGCTGCGGTT TATACCCAGA CAACGGATGT GGAAATAGAA
ACCAATGGCC TGTTTACGTA TGATAGAAAA GTGTTAAAAA TGCCTTTGGA TGGAATGTAT
CAGCTGCATC GCCAGTTATA CGATCCATCG CTTGTTAAAT GGTAA
 
Protein sequence
MTRLNLSLLL LLMVHFAHAQ WKPVPGKIST DWAAKVNPSN PLPEYPRPQL VRKSWINLNG 
LWQYAILPKG SDKIPVSYAG QILVPYAVES SLSGVGKMVG ENNVLWYNRS IDLPAKMMGG
KLLLHFGAVD WSCRVYVNGK LAGEHAGGYD AFSFDLTSLV RKGVKQNISV QVWDPTDDGP
QPRGKQVKQP KGIWYTPVTG IWQTVWLENV PQTYIVATKQ TPDIDKKQLA VRVEVADLQS
GDQLEVTAWE GAKRVALQAG DPQRELILNI PDPRLWSPES PFLYDLKVAV KRKGKTIDEV
ASYFGMRKSA MAKDAAGIQR MTLNNQFVFQ YGPLDQGWWP DGLYTAPTDE ALKFDIEKTK
ALGFNMIRKH VKVEPARWYY HCDKMGMLVW QDMPSGDTGG NVWDAKPGFI TGGKLDKDRS
PESENIFRKE WKAIMDQCYN YPSIVSWVPF NEAWGQFKTK EIVDWTMKYD PSRLVNAASG
GNYFMGAGQV LDLHIYPAPA MPDPAIFGAR QALVLGEFGG LGLPIEGHTW LDKGNWGYQS
YKNKEDLFAQ YTKFISAIPK LIRSGLSAAV YTQTTDVEIE TNGLFTYDRK VLKMPLDGMY
QLHRQLYDPS LVKW