Gene Phep_4104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4104 
Symbol 
ID8255238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4950713 
End bp4952929 
Gene Length2217 bp 
Protein Length738 aa 
Translation table11 
GC content46% 
IMG OID644937768 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_003094357 
Protein GI255533985 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.28211 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTAT TTCAATTTAT ACTATTCTTC TTTACAGGTC TGCAGCTTTT AAAAGTACAG 
GCCCAGGAAA AAACCGGTTA TATTCCCTTA ACCAAACAAG AAAGGCAAAA AGTAGAACTC
CTGCTGAGTA AAATGACCTT GGAAGAAAAA GCGCATCAGC TGGCCTCATT TTATCCCAAT
GCCAATAAAA GATTAAATAT CCCCCATATG CAGGCCGGTG AATGTCTGCA TGGTGTGGTT
GCTGCCGGCA CCACTTCTTT CCCTCAGGCC ATTTCCATAG CCAGTTCCTG GGATCCTTCG
CTTGTCGAAA GGGTATCTAC CGTGATTGCA AAAGAAGCCA GGGCTTTAGG CATACACCAC
TGTTATACCC CAATGCTTGG GGTTTTGCGC GATGCGCGCT GGGGCCGTTT CGAAGAAGGT
TATGGAGAAG ATGCCTACCT GGTCAGTAAA ATCGGCGTAG CCTTTATCAA TGGCCTGCAG
GGCCGTGGCA AAAACCGCTT CGATAAGGAC CATGTAGTGG CAACAGCTAA ACATTTTGTG
GCCGATAGTG AACCCCTGCT GGGTGCCAAT GGTGCTGCAG TCGAAATTTC CCTGCGTAGT
TTGCACGAAG TTCACCTTCC GCCTTTCCGG GCTGCAGTAG AAGAAGCTCA GGTTGGTTCG
GTCATGCCTG CACATCATAC CTTAAATGGG GTGCCCTGTC ACATCAATAC CTATACCCTA
AACGATGTAT TCAGAAAGGA ATACGGCTTT GATGGTCTGG TGGTTTCTGA TAACAACGAC
CTGAGGTGGG TTCAGGAGCG CTTGTTCGCC ACCGAAAGCC AGGAAGAAAC CATCAGAAAA
GCACTGGAAG CAGGTGTGCA TACCGAGCTT GCCTTTAAAC AGACCTGGGC CGATAAAAGA
ATGTATGGCC CCCCACTGGT CGCCGCGGTA AAAAACGGAA AAGTGCCGGT AAAACTGCTC
GACGACGCCG TTAGAAAAGT ACTGGAATTT AAGATTGCCC TGCACCTCGA CGAAGAAGAA
AATCCATTGG GCAAGGAAAT GACCGAATTA CAAAAAGGTA CAAAAGATGC AGATGTAAAT
GCTGATGTAT TCTTTTCGCA GATCGATGGC TCATTGTCCA GCCCCAGATC AAACTATAAA
ACCGTACTAA ATAATCCTGT ACACGATGCA CTTGCACTCG AAGCAGCCCG CAAAAGTCTC
ATCCTCCTAA AAAACAACAA CCTGCTGCCA TTTAAAAAAA GTCAGTTCAA AAAGATAGCC
GTAATTGGTC CAAATGCCGA TACCATTCGC CTGGGCACTT ATTCTACCCA GCAGCCTAAA
CACTTCATTA CTGTAAAACA AGGCATCGAA ACTGCTGTAG GTAAAAATGC ACAGGTATTG
TATGCGAAAG GGACTGATAT CCAGCATCCA AAAGATACGC AGCTTGCAGA AGCCGTTGCC
ATTGCAAAAG AAGCTGATGT ATGTATCCTG GTGCTGGGCG ATGATGATAA AACCGTAATG
GAAAATGTGG ATAGGGACGA CATTACCTTG CCGGGCGACC AGGATAAGCT GATGCAGGCC
ATTGTAGCCA CAGGCAAACC TGTAGTACTG GTATTGCTGC ATGGCCGTCC GGCCGCTATT
CAATGGGCCA AAGACCATGT TCCGGCCATA TTAGACGGAT GGTTTCTGGG GCAGGAAACA
GGTACTGCCA TTGCAGAAGC CATATTTGGC GATCTGAACC CTTCCGGAAA ATTAACTGTT
ACCTACCCAA GAAATGTAGG TCAGGTACCT GCATTTTATA ATACTTTAAT ACCAGGCAGG
CCAAGAATGA TGTGGGGAAC TACAGAAGGT GCAACCTATC CCTTTGGTTA TGGCATCAGC
TACACACAAT TTAAATATGG AGTACCAAAA CTCTCTAAAG CCAGCATGAA AGCCAGTGAA
ACTGTTTTTG CCGAAATCGA AGTAACCAAT ACCGGTAAAG TGGCTGGCGA TGAAATTGTG
CAGCTGTACC TTCGTGATGA CATCTCTTCA CTGGCAAGGC CAATTAAAGA ATTAAAAGGG
TTTAAACGCA TTAGCCTGCG TCCGGGCGAA ACCCAAAAGA TTTCCCTGCC CATTTCTTCC
CGTTCGCTTG AATTCTGGAA AGATGGCAAA TGGATTACCG AACCTGGCAG TTTCACAGTC
ATGATGGGCC CAAATTCTGA AGAACTGAAA ACCATTAAAT TAGAACTGAC CCAATAA
 
Protein sequence
MRLFQFILFF FTGLQLLKVQ AQEKTGYIPL TKQERQKVEL LLSKMTLEEK AHQLASFYPN 
ANKRLNIPHM QAGECLHGVV AAGTTSFPQA ISIASSWDPS LVERVSTVIA KEARALGIHH
CYTPMLGVLR DARWGRFEEG YGEDAYLVSK IGVAFINGLQ GRGKNRFDKD HVVATAKHFV
ADSEPLLGAN GAAVEISLRS LHEVHLPPFR AAVEEAQVGS VMPAHHTLNG VPCHINTYTL
NDVFRKEYGF DGLVVSDNND LRWVQERLFA TESQEETIRK ALEAGVHTEL AFKQTWADKR
MYGPPLVAAV KNGKVPVKLL DDAVRKVLEF KIALHLDEEE NPLGKEMTEL QKGTKDADVN
ADVFFSQIDG SLSSPRSNYK TVLNNPVHDA LALEAARKSL ILLKNNNLLP FKKSQFKKIA
VIGPNADTIR LGTYSTQQPK HFITVKQGIE TAVGKNAQVL YAKGTDIQHP KDTQLAEAVA
IAKEADVCIL VLGDDDKTVM ENVDRDDITL PGDQDKLMQA IVATGKPVVL VLLHGRPAAI
QWAKDHVPAI LDGWFLGQET GTAIAEAIFG DLNPSGKLTV TYPRNVGQVP AFYNTLIPGR
PRMMWGTTEG ATYPFGYGIS YTQFKYGVPK LSKASMKASE TVFAEIEVTN TGKVAGDEIV
QLYLRDDISS LARPIKELKG FKRISLRPGE TQKISLPISS RSLEFWKDGK WITEPGSFTV
MMGPNSEELK TIKLELTQ