Gene Phep_0657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0657 
Symbol 
ID8251745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp764852 
End bp766189 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content45% 
IMG OID644934306 
Productbeta-galactosidase 
Protein accessionYP_003090941 
Protein GI255530569 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.647462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGGG CCTCAGATTT TGGAAATGAT TTTTTATGGG GTGTGGCCAC GGCTGCGGCA 
CAGATTGAAG GTGCTGCCGA AGGGTATGGT AAAGGTTTGT CTATATGGGA TACCTTTTCC
AAACGATCCG GGAAAATCAA AAAAGGACAT CTGCCAACTC ATACCTGCGA TTTTTATCAC
AGTTATAAAG CAGATATTGC CCTTGTAAAA ATGCTGGGTT TCGGTATTTT TCGTTTCTCT
ATTTCCTGGC CTCGCATCCT GCCCAACGGC AAAGGGCGCA TCAATCCGGA AGGGATTCTG
TTTTATCATC AGGTTATAGA TGAATGCTTG CTGCAGGGTA TTGTACCTTA CATTACTTTA
TACCATTGGG ACCTGCCCGA TGCACTGGAA GATGAGGGTG GATGGACGGC TTTTTCTGTC
AACAACAGCT TTAACCACTT TGTTACCGTA TGTGCAAAAG CATATGGTGA TAAAGTGAAA
AACTGGATCG TATTGAACGA GCCTTTTGGC TTTACTTCAC TGGGTTATAT GCTTGGTGTA
CACGCTCCAG GCAAAACAGG ACTCACTAAT TTTTTCTCTG CCGTTCATCA TACGGCAATT
GCACAGGCCG ACGGAGGCCG GATATTAAGA GCTGAAGTTC AAAATGCACA TATTGGTACC
AGCTTTTCCT GCTCCGAAAT TATTCCCTAT ACACAAAGTG ATTCCGACCT GCTGGCCTCA
AAACGGGTTG ATTGCCTCAT GAACCGTTTG TTTATAGAGC CAGCATTGGG TATGGGTTAT
CCGACGGCCG ACTGGGAGCT GATGGAAAAG TTTTCTATCC AGCATTCTAC CTGGAGGCAT
ACTGAACGCC TGGCCTTCGA TTTTGATTTT ATAGGCCTGC AGAACTATTT TCCACTAACC
ATCAAATACA ATGCTTTTAT TCCTGTGGTA CAGGCCTGGG AAGTAAAGGC CAAAAGCCGT
AAAAAGCCGC ATACTGCCAT GGGCTGGGAA ATTAATGCCG GCAGTTTTTA CAACATTATC
AAACAGTTTA GCGCCTATCC CAATATCAAA AGTTTAATGA TTACCGAAAA TGGGGCGGCC
TATCACGATA AACTGATCCA TAACCAGGTG CATGACCAGG ACCGGATTGA TTATTTTCAG
CAATACCTGG GCGCGCTGTT AAAGGCAAAG CAGGAGGGGC TCAACATTAC CGGCTATATG
GCCTGGACAC TGATGGACAA TTTTGAATGG GCAGAAGGTT TTAATGCCCG CTTTGGCCTG
GTGCATACCG ATTTTAAGAC CCAACAGCGT ACTGTTAAAG ATTCCGGACT TTGGTTTAGG
GACTTCCTTC GCCGTTAA
 
Protein sequence
MIRASDFGND FLWGVATAAA QIEGAAEGYG KGLSIWDTFS KRSGKIKKGH LPTHTCDFYH 
SYKADIALVK MLGFGIFRFS ISWPRILPNG KGRINPEGIL FYHQVIDECL LQGIVPYITL
YHWDLPDALE DEGGWTAFSV NNSFNHFVTV CAKAYGDKVK NWIVLNEPFG FTSLGYMLGV
HAPGKTGLTN FFSAVHHTAI AQADGGRILR AEVQNAHIGT SFSCSEIIPY TQSDSDLLAS
KRVDCLMNRL FIEPALGMGY PTADWELMEK FSIQHSTWRH TERLAFDFDF IGLQNYFPLT
IKYNAFIPVV QAWEVKAKSR KKPHTAMGWE INAGSFYNII KQFSAYPNIK SLMITENGAA
YHDKLIHNQV HDQDRIDYFQ QYLGALLKAK QEGLNITGYM AWTLMDNFEW AEGFNARFGL
VHTDFKTQQR TVKDSGLWFR DFLRR