Gene Phep_3638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3638 
Symbol 
ID8254769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4351675 
End bp4353975 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content46% 
IMG OID644937299 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003093891 
Protein GI255533519 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.456784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAG CTAACTTATT TATCCTTACG CTCATTTTTA CTGCCCATAT AACTTTTGCC 
CAAACCAAAA AAACGGGCAA AGCTACCCAT CAGCAAATGA ATACCTTCAT CAGTAACCTG
ATGTCAAAAA TGACATTAGA CGAAAAGATC GGGCAGCTAA ATCTGCTTAC CGGTGGCGAA
GCTACTACGG GCTCGGTAGT AAGCACCGAT GTAGAAAGTA AAATCAAGAA AGGACAGGTA
GGAGGTATTT TTAGCTTAAC CACCCCTGCA CGCATCCGTA AGGCCCAGGA AATAGCCGTT
AATCAGACCC GGCTCAAAAT ACCTATCATA TTTGGTCAGG ATGTGATCCA TGGATACAAA
ACTACTTTCC CTATTCCGCT GGCCTTATCC AGTACCTGGA ACATGGAAAT GATCAAAAAA
ACAGCCCGGA TTGCCGCCAT AGAAGCTACA GCAGATGGGC TGAACTGGAC CTTCTCGCCC
ATGGTCGATA TTTCCAGGGA CCCCCGCTGG GGCCGTATTT CTGAAGGTTC AGGAGAAGAC
ACCTATCTGG GCTCAGAAAT TGCCAGGGCC ATGGTAAAAG GCTACCAGGG TGATGACCTT
GCTAAATACA ATACCATGAT GGCCTGTGTA AAACATTTTG CTTTATACGG GGCGTCAGAA
GCGGGCAGAG ACTACAATAC CACGGATATG AGTCTGGATC GCATGTATAA TGAATACTTG
CCGCCCTATA AAGCTGCGCT TGATGCGGGC GCAGGCAGTA TCATGACCTC CTTTAACGAC
ATCAATGGAG TGCCTGCCAC TGCAAACAAA TGGCTCATGA CTGATCTTTT GCGTAAAGAA
TGGGGGTTTA AAGGTCTTGT GGTTACCGAC TATACTGCGG TTAACGAGCT GATAGACCAT
GGCCTGGGCG ATCTGAAAGC CGTATCGGCC CTGTCCATTA ACGCAGGTGT AGACATGGAC
ATGGTTGGCG AAGGCTTTTT AACCACCCTG AAAAAATCTG TACAGGAAGG CAAAGTAAAA
GCACAACGCA TTGATGAAGC CTGCAGATTG GTTTTAGAGG CCAAATATAA GCTGGGCTTA
TTTGACGATC CTTTCCGTTA TTGCAATGAA GAAAGGGCTA AAACGGAAAT TTTAAAACCT
GAACACCTTG CTTTTGCCCG AGAGGTAGCT GCCGAATCTT TTGTGCTTCT GAAAAATGAA
AACCAGACCT TGCCCCTTAA AAAAACGGGT ACCATAGCTT TAATAGGCCC CCTGGCCAAT
ACAGGGGCCA ATATGCCTGG TACCTGGAGT GTAAACAGCG ACCTGGCCAA TACCGCCTCT
CTTTTAACAG GCATGAAGGC CGTTTTGGGC AAACAGGTAA AAGTGGTACA CTGCCTGGGT
TCAAACCTGG TAACAGATGA GGCCTACCAG CAGCGTGCCA CCATGTTTGG CCGCGACATT
CCAAGAGACA ACCGCCCTGA AGCAGAAGTG ATAAAAGAAG CTGTGGAACT GGCCAAAACT
GCAGATGTAG TTATTGCTGC ATTAGGTGAA AGTTCTGAAA TGAGCGGCGA AGCCTCCAGC
CGTACCAACC TGGAAATCCC GGAAGTTCAG CAGCGTTTGT TACAAGCCCT GTTAAAAACA
GGTAAACCTG TTGTACTGGT CTTGTTTACC GGCAGGCCAC TGGTACTGAA CTGGGAGCAG
CAAAACGTAC CGGCCATTTT AAATGTCTGG TTTGGTGGTA CAGAAACGGC AAAAGCAATA
ACTGATGTAC TGTTTGGCGA TGTGAACCCT TCGGGAAAAT TAACAGCCAC CTTCCCGCAA
AATGTGGGGC AGATTCCCTT ATATTATGCG CATAAAAATA CCGGCAGGCC ATTGGCAGAT
GGAAAATGGT TCAGTAAGTT CCGCTCCAAC TATCTGGACG TAAGCAACGA ACCCCTATAC
CCTTTCGGTT ACGGCCTAAG CTATACCAGT TTTGCCTACA GCAATTTAAG GTTAAGTAAA
AACAGCTTTA AACCTGGTGA ATCTATTACC GCCAGCATCG ATATTAAAAA CATCGGTTCA
AGGGAGGGCA AGGAAGTGGT ACAATTGTAC ATACGCGACC TGGTGGGCAG TTCTACCCGT
CCGGTTAAAG AACTAAAGGC TTTCCAAAAG ATCAGCCTAA AACCCGGAGA AAGCAAAACT
GTGAGCTTTA AGCTGACCGA AAATGATTTG AAGTTCTATA ATACGGCTTT AAGGTTTGTT
GCCGAACCCG GTGATTTTAA CTTGTTTATT GGCGGAAATT CCAGAGATGT GCTGGAAACG
AAGTTTAGTT TAAGAAATTA A
 
Protein sequence
MKIANLFILT LIFTAHITFA QTKKTGKATH QQMNTFISNL MSKMTLDEKI GQLNLLTGGE 
ATTGSVVSTD VESKIKKGQV GGIFSLTTPA RIRKAQEIAV NQTRLKIPII FGQDVIHGYK
TTFPIPLALS STWNMEMIKK TARIAAIEAT ADGLNWTFSP MVDISRDPRW GRISEGSGED
TYLGSEIARA MVKGYQGDDL AKYNTMMACV KHFALYGASE AGRDYNTTDM SLDRMYNEYL
PPYKAALDAG AGSIMTSFND INGVPATANK WLMTDLLRKE WGFKGLVVTD YTAVNELIDH
GLGDLKAVSA LSINAGVDMD MVGEGFLTTL KKSVQEGKVK AQRIDEACRL VLEAKYKLGL
FDDPFRYCNE ERAKTEILKP EHLAFAREVA AESFVLLKNE NQTLPLKKTG TIALIGPLAN
TGANMPGTWS VNSDLANTAS LLTGMKAVLG KQVKVVHCLG SNLVTDEAYQ QRATMFGRDI
PRDNRPEAEV IKEAVELAKT ADVVIAALGE SSEMSGEASS RTNLEIPEVQ QRLLQALLKT
GKPVVLVLFT GRPLVLNWEQ QNVPAILNVW FGGTETAKAI TDVLFGDVNP SGKLTATFPQ
NVGQIPLYYA HKNTGRPLAD GKWFSKFRSN YLDVSNEPLY PFGYGLSYTS FAYSNLRLSK
NSFKPGESIT ASIDIKNIGS REGKEVVQLY IRDLVGSSTR PVKELKAFQK ISLKPGESKT
VSFKLTENDL KFYNTALRFV AEPGDFNLFI GGNSRDVLET KFSLRN