Gene Phep_1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1722 
Symbol 
ID8252824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2041117 
End bp2042184 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content48% 
IMG OID644935374 
Productglycosidase PH1107-related 
Protein accessionYP_003091995 
Protein GI255531623 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2152] Predicted glycosylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000791852 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000765137 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAAGATC TTGCCCAACG TTTTCCTGAA AATCCCCTGC TGCTACCCAA AGATCTCAGT 
CCGAGTGCCT CAGGGTTGCA GATCATTTGC CTGCTGAATC CTGGGGTATT CAGGTTTGAA
GGAAAAACAT GGTTGCTGGT TCGCGTAGCA GAAAGTGTAA AACAGGTAGA AGGTTGGGCC
TTTATCCCCT TGCTGAATGA TGAAGGCATA CTGGAAATCA TCGAGGTGCC CTTAAATGAT
CCTGATCTGA TCGCATCTGA TCCACGGGTG TTCAATTACC AGGGCCTGGA TTACCTGACC
ACCGTTTCTC ACCTCAGGTT GCTGAGCAGT GAAGATGGTG TTCACTTTCA GGAAGAAGCA
GGTTACCCGG GTATTTTCGG ACAGGGTAAG CTGGAAAGAT TTAGTATTGA AGATTGTCGT
GTCGTCCTTC TGGATGGAAA ATATTACCTC ACCTATACCG CTGTATCTGA TAACGGAGTA
GGAGTAGGCA TGCAGATCAC TACAGATTGG AAACATTTTG AGCGTAAAGG AATGATTTTA
CCTCCGCACA ATAAGGATGT GGCTCTTTTC GAGGAAAAGA TCAACGGAAA GTTTTACGCT
TTACACAGGC CCAGCAGTAA GGACATCGGG GGCAATTACA TCTGGCTGGC GGAATCTCCG
GATGGCCTGC ACTGGGGCAA TCATCAATGT ATCATCAAAT CCAGGCCGGG TATGTGGGAC
AGTGCAAGGG TCGGAGCCGG TGCTGCACCC ATCAAAACTG AACATGGCTG GCTGGAAATT
TACCATGGAG CCGATGCTGA ACACCGGTAT TGCCTTGGCG CACTGTTGCT GGATCTGCAC
GATCCTTCCA TTGTACTGGC CAGAAGTATT GAACCAATCA TGGTACCTAC AGAGAAATAT
GAACTCAGCG GGTTTTTCGG ATTTGTGGTG TTCACTAATG GTCATGTAGT GGAAGGTGAC
CGGCTAACCA TTTATTATGG TGCTGCTGAT GAGTTTGTTT GCGGCGCCCA TTTTTCGATC
AATGAAATCC TGGGGTCTCT GGGCTTTGGT GTGTTCCCGG GCGTTTGA
 
Protein sequence
MKDLAQRFPE NPLLLPKDLS PSASGLQIIC LLNPGVFRFE GKTWLLVRVA ESVKQVEGWA 
FIPLLNDEGI LEIIEVPLND PDLIASDPRV FNYQGLDYLT TVSHLRLLSS EDGVHFQEEA
GYPGIFGQGK LERFSIEDCR VVLLDGKYYL TYTAVSDNGV GVGMQITTDW KHFERKGMIL
PPHNKDVALF EEKINGKFYA LHRPSSKDIG GNYIWLAESP DGLHWGNHQC IIKSRPGMWD
SARVGAGAAP IKTEHGWLEI YHGADAEHRY CLGALLLDLH DPSIVLARSI EPIMVPTEKY
ELSGFFGFVV FTNGHVVEGD RLTIYYGAAD EFVCGAHFSI NEILGSLGFG VFPGV