Gene Phep_0794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0794 
Symbol 
ID8251883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp941510 
End bp943915 
Gene Length2406 bp 
Protein Length801 aa 
Translation table11 
GC content45% 
IMG OID644934444 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003091078 
Protein GI255530706 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTATA AATCATTTTC TACGCTGGCT TTTTTGTTTT TGGTAACAAC TGCCGGTGCG 
CAGCTCAAAA ATATCTATCA TAAGGGCTGG GTAGATTTTA ATAAAAATGG GAGGATGGAT
GTTTTTGAAG ATCCGTCAAG GCCTGTTGAT GCAAGGGTGA AAGATTTGCT GGGGCAAATG
AACCTGGACG AGAAAACCTG TCAGACAGCA ACACTTTACG GTTACGGAAG GGTTCTAAAA
GACGAAATGC CGACAGCGGA ATGGAAAACG AGTATCTGGA AAGATGGTAT CGCAAATATA
GATGAGGAAC TGAATAGCCT GCCTTATAAT AAAAAGGCCG TTACACAATA TTCTTTCCCT
TTTAGCAAAC ATGCCCATGC CATTAATACC GTGCAGAAGT GGTTTGTAGA GGAAACCCGC
CTGGGTATTC CGGTAGATTT CAGTAATGAA GGTATTCATG GTTTATGCCA CGATCGGGCT
ACCCCTTTTC CGGCCCCTGT GAATATCGGC AGTACCTGGA ACAAAAGTAT TGTGTACCAG
GCAGGAAGTA TTGTTGGCCG TGAAGCAAAG GCATTGGGCT ATACCAATGT ATATGCACCG
ATCCTTGATG TTGCCCGCGA CCAGCGCTGG GGAAGGGTAG TGGAATGTTA TGCAGAAGAC
CCTTTTTTGA TTGCAGAATT AGGTAAACAA ATGACCATGG GTATCCAGGA CCAGGGAACT
GCTGCTACCT TAAAGCATTA TGCTGTATAT AGTGTGCCTA AAGGCGGACG CGACGGACAG
GCACGTACTG ATCCGCATGT AGCGCCAAGG GAAATGCATG AAATGTTCCT ATATCCGTTC
AGAAGAGTAA TCCAGGAAGC TAAACCTATG GGCATCATGA GCAGTTATAA TGATTGGAAC
GGAGAACCTG TAACCGGAAG TTATTATTTT CTTACAGAGC TGCTGCGTAA ACAATACGGT
TTTGATGGCT ATGTGGTTTC GGACAGTGAA GCGGTAGAAT TTATTTCGGG TAAACATCAT
GTGGCAGAAG ATTACAAGCA GGCTGTTAAA CAGGCTATAG AGGCTGGCTT AAATGTGCGT
ACCCACTTTA CCAAGCCCGA AAACTTTATC CTTCCGCTAA GAGAATTGGT TAAGGAAGGT
TCGGTATCTA TGAAAACGCT TGATGAACGT GTGGCTGATG TATTGCGTGT AAAGTTCAGA
CTGGGCCTTT TTGATGACCC TTATGTAAAA GACCCGGCCG CTGCAGACAA AAAGGTGCAT
ACCAGGGCGG ATGAAGAGCT GGCGGTACAA CTGAACAGGG AATCTATGGT ATTGTTAAAA
AACGATAAAA ACCTGTTGCC ACTTGATATT GCTAAATATA AGCGCATCCT GGTTAGCGGA
CCATTGGCTA CCGAGATAAA TTACACCACC AGCAGGTATG GCCCCTCAAA TAACCCTATT
GTTTCTATCC TTGATGGAAT AAAAGCTTAT GCAGGTAAAA ATTCAACAAT AGCCTATAGT
AAAGGCTGTG AGGTGATTGA CGCCAAGTGG CCGGAAAGTG AGATTATTCC TGTCGAACTT
ACCACTGAAG AACAGCTGCA AATTGACCAG GCTGTGGCAG CTGCAAAAGC ATCAGATGTT
ATTATTGCTG TAGTAGGAGA GACCGATGAA CAGGTTGGGG AAAGTAAATC CAGAACCGGC
TTAAATTTGC CGGGGCGCCA GTTAATGCTG TTACAGGCCT TGCATGCTAC AGGCAAGCCT
GTAGTAATGG TTATGGTGAA CGGACGTCCT TTGACCATCA ACTGGGAAAA CCGCTACTTG
CCGGCCATCC TGCAGGCAGG ATTTCCCGGG CCATCAGCAG GTAAAGTAGT AGCCGAAACA
TTATTCGGTG ATAACAATCC CGGAGGTAAA CTGACAATGA CCTATCCAAA ATCTATCGGG
CAAATTGAGC TGAATTTCCC TTTCAAACCA GGATCGCAGG CTGGTCAGGG TAAAAATGAC
GATCCAAACG GGAACGGAAA AACCAGGGTG CTTGGTGCGC TGTACCCATT TGGATACGGC
TTAAGTTATA CCACTTTTGA GTTCAGTAAT TTAAATCTGG ACAAGAAAGA AATCCATAAC
CAGGCCGATG TACAGGTCAG TGTTGATGTG AAAAACACCG GTCAGCGCAA GGGTGATGAA
GTGGTACAAC TGTACCTGAA AGATGTAGTC AGCAGCGTGA CTACCTATGA ATCTGTATTG
AGGGGATTTG AACGTGTGAG TCTGGCACCC GGTGAAACCA AAACCCTTAA GTTTACCCTT
CATCCGGACG ATCTGGCCAT CCTTGATAAA AACATGAACC GGACTGTTGA ACCCGGAAAA
TTCATTGTCA TGATTGGTAA CTCTTCAGAA GATATTAAAC TGAAAAAGGA ATTTACAGTA
AAATAG
 
Protein sequence
MKYKSFSTLA FLFLVTTAGA QLKNIYHKGW VDFNKNGRMD VFEDPSRPVD ARVKDLLGQM 
NLDEKTCQTA TLYGYGRVLK DEMPTAEWKT SIWKDGIANI DEELNSLPYN KKAVTQYSFP
FSKHAHAINT VQKWFVEETR LGIPVDFSNE GIHGLCHDRA TPFPAPVNIG STWNKSIVYQ
AGSIVGREAK ALGYTNVYAP ILDVARDQRW GRVVECYAED PFLIAELGKQ MTMGIQDQGT
AATLKHYAVY SVPKGGRDGQ ARTDPHVAPR EMHEMFLYPF RRVIQEAKPM GIMSSYNDWN
GEPVTGSYYF LTELLRKQYG FDGYVVSDSE AVEFISGKHH VAEDYKQAVK QAIEAGLNVR
THFTKPENFI LPLRELVKEG SVSMKTLDER VADVLRVKFR LGLFDDPYVK DPAAADKKVH
TRADEELAVQ LNRESMVLLK NDKNLLPLDI AKYKRILVSG PLATEINYTT SRYGPSNNPI
VSILDGIKAY AGKNSTIAYS KGCEVIDAKW PESEIIPVEL TTEEQLQIDQ AVAAAKASDV
IIAVVGETDE QVGESKSRTG LNLPGRQLML LQALHATGKP VVMVMVNGRP LTINWENRYL
PAILQAGFPG PSAGKVVAET LFGDNNPGGK LTMTYPKSIG QIELNFPFKP GSQAGQGKND
DPNGNGKTRV LGALYPFGYG LSYTTFEFSN LNLDKKEIHN QADVQVSVDV KNTGQRKGDE
VVQLYLKDVV SSVTTYESVL RGFERVSLAP GETKTLKFTL HPDDLAILDK NMNRTVEPGK
FIVMIGNSSE DIKLKKEFTV K