Gene Phep_2300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2300 
Symbol 
ID8253406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2675070 
End bp2676161 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content44% 
IMG OID644935949 
Productglycosyl hydrolase family 88 
Protein accessionYP_003092566 
Protein GI255532194 
COG category[R] General function prediction only 
COG ID[COG4225] Predicted unsaturated glucuronyl hydrolase involved in regulation of bacterial surface properties, and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00753245 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0687608 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTA ACTTTAAAAC ATTTGGAATG CTACTGGTCA GTAGTTTCAC ATGGTCGGTT 
GCAGTACATG CACAGCAGTT CACTAAAAAA CGAATCATTG ACAAGATGAC CAAAGTTGCT
GACTGGCAAG TGGCCCAGAG TTGGTCAACT AGTAAGCCAG GCAACCCACC AACTGGTAGC
CGCTCTTGGG AGGCTGGTGC TTTTTATCCA GGTGTAATGG ACGCCTACAG GGCTACCAAA
AATGAGAAGT ATATGGAGGC CGTAAAACTG ATGGCCGAGA GCAATCGATA TGACCGTGGA
CCTAAACTTA GGAATGCAGA CGATCAGGCG ATTTTGCAAA CTTATCTTGA AATGTATGAA
TTTACTGGAG ATCCGAAACT ATTGACGGCC ACAAAAAGAA CTTTAGACTC GATCATGCTG
TCGCCAAAGC CAGGACGCTT AGAGTTTTCT TGGTGTGACC TTTTATTTAT GGCACCACCC
ATCTGGAGTC GCTATTCGGC CATTAGCAAA GATCAAAAGT ATCTCGATTA CTTGGCGGAA
ATCTATTGGG ATGCTGCTGA CAACCTACAA AACAAAACCT ATAAACTATT TTATCGTGAC
AATAGATTTA AATCTATCGT TGGATCAACA GGTAAACCGG TATTCTGGAG TCGTGGGAAT
GGGTGGGTGG TTGCTGGTCT TGCGCGTACA CTAGAAGCGA TGCCGGCACG CTATCCAGGT
CGGAAAAAGT ATGAAGATTT ACTTATTGAG CTAGCATCTT CACTAAAATC TCTTCAGCAG
CCCGATGGAT TCTGGAAGTC GGATCTGCTA GACCCAGAGA TTTATCCAAT GGGAGAAACC
AGTGGCACAG CGTTCTTTTG TTATGGGATT GCCTGGGCGA TAAATCATAA GCTGCTTGAT
CGCAAAGAAT ACTTGCCAGT GGTACTAAAA GCATGGAATG CGCTGAATAG TGTAGTTTTA
GCCGACGGCA AGCTCGGCTT TGTGCAACCA GGCGGCGATA GACCCTATCT TTCAACCGCG
AATATGAGTA ATTGGTATGC AGCTGGTGGT TTTTTAATGG CGGGAAATCA AGTTTTAAAA
TTTGCAAGGT AA
 
Protein sequence
MKINFKTFGM LLVSSFTWSV AVHAQQFTKK RIIDKMTKVA DWQVAQSWST SKPGNPPTGS 
RSWEAGAFYP GVMDAYRATK NEKYMEAVKL MAESNRYDRG PKLRNADDQA ILQTYLEMYE
FTGDPKLLTA TKRTLDSIML SPKPGRLEFS WCDLLFMAPP IWSRYSAISK DQKYLDYLAE
IYWDAADNLQ NKTYKLFYRD NRFKSIVGST GKPVFWSRGN GWVVAGLART LEAMPARYPG
RKKYEDLLIE LASSLKSLQQ PDGFWKSDLL DPEIYPMGET SGTAFFCYGI AWAINHKLLD
RKEYLPVVLK AWNALNSVVL ADGKLGFVQP GGDRPYLSTA NMSNWYAAGG FLMAGNQVLK
FAR