Gene Phep_2885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2885 
Symbol 
ID8253995 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3434667 
End bp3436700 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content44% 
IMG OID644936532 
ProductAlpha-galactosidase 
Protein accessionYP_003093145 
Protein GI255532773 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.14976 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAT TCATATCCGG GTGTTTTACA CTTTTATTAA CCTTTTCATT TTTAGTAAGT 
TTTGCGCAGA AAAACCATGT AGTATGGCTG GATGACCTTC CTATCCAAAG TTTTTCAGAC
GGCATCCGTC CGGTGGAGGT AAAAGCCAAC TATGGTAAGG ATACCATGTG TGTAAAGGGT
GTTAAGTATT TAAGGGGATT GGGGGCACAA AGCATCAGTA TCCTTAAATT TGACCTTTCT
AAACAGGCCA TACGTTTTTC GGCAATGGCG GCAGTGGATG ACCATGGTAA TAAGGATATA
GCGCTGCGGT TTTATGTATT GGGCGACGGT AAAATATTGT TTGAAAGCGG GGAAAGGAGG
GTTGGAGATG AGCCGCTGAA AGTAGAGGTA GATTTGAGCG GAATTAAACA ACTTGGACTA
CTGGTTACGG ATAAGGTAGG TGGTGTTGGT AATAAAAGGA CCTATGCCAA CTGGATCAAT
GCCAAACTGG AAATGAAAGA GGGGCATTTG CCCGGATACC TGCGGTATCC AGATCAGAAA
TATATACTGA CCCCGCTACC CAAACAAACA CCTAAAATAA ATTCGGCCAA AGTATTTGGG
GCAAGTCCGG GCAATCCTGT CTTATACACA ATAGCAGCAA CGGGCAGGCG GCCTATGCAA
TTTTCGGCGC CTGGTTTACC CAAAGGATTA TCCATATCTG CATCAACCGG CATCATTACC
GGGGTAGTTA AAGAAAAAGG AAACTACAGT GTACTGCTGA AGGCTAAAAA TAACCTCGGT
GAAGCGAAGC AAAAATTAGT GATCAAAATT GGGGATACCA TTGCATTGAC ACCCCCACTG
GGTTGGAATG GATGGAATTC TTGGGAAACT AAAATTGACC GGGAAAAAGT AATGGCTTCT
GCCCAGGCCA TGGTAAATAA AGGTTTACGC GACCATGGCT GGAACTATAT CAATATTGAT
GACAGCTGGC AGGGCGTAAG AACCAGGCCA GACACCGCCT TACAACCGAA TGAGAAGTTT
CCCGACTTTA AAAGTATGGT TGATGCGATA CATGCATTGG GTTTAAAAGC TGGTTTGTAT
TCTACACCTT ATGTTTCCAG TTATGGCGGG TATGTAGGTG GCTCCTCTGA TTTTCCGGCA
GGAGGGGAAA CACATGAGCG CATTAAAGTG AACAGGCAAT CTTTTATGCA CATCGGAAAA
TACAGGTTTG AAACAATAGA CGCCAGACAA ATGGCGAGCT GGGGCTTTGA CTTTTTAAAA
TACGACTGGC GGATAGATGT AAATTCTACG GAACGTATGG CCGATGCCCT GAAAAAATCA
GACCGTGATG TGGTATTCAG TTTATCCAAC AATTCACCTT TTGAAAAAGT GAAAGACTGG
ATGCGCCTTT CACATATGTA CCGAACCGGC CCTGATATTA AAGATAGCTG GAATAGTTTG
TACACTACGG TATTCTCGAT CGATAAATGG GCAGCCTATA CTGGTCCCGG ACATTGGGCC
GATCCGGATA TGATGATTGT TGGTGATGTT GCAATTGGTC CGGTAATGCA TCCTACAAAA
TTAACAGCAG ATGAACAGTA TAGCCATGTT AGCATATTCA GTTTGCTGGC CGCACCCATG
TTGATCGGCT GCCCTATTGA GAAGCTGGAT GCATTTACAC TAAACCTGCT GACCAATGAT
GAAGTGATTG CCATTAACCA GGATCCACTT GGGAAAGCTG GCCGGCTTTT ATTGCGTGAG
GCCGGCATAG AGGTTTGGGT AAAACAACTG GAAGACGGTG CTTATGGTAT AGGTATTTTC
AACACTGCCG GGTATGGAGA AACACCCCAG TCTTATTTTC GCTGGGGCGA TGAAAAAGAA
AAACTATATG CACTGGATTT TACTAAGATA GGCCTGAAAG GAAAATGGCA GATCAGAGAT
GTGTGGCGGC AAAAATCCCT GGGGCAATAT AGTGGACCAT TCACCACTAC TGTTCCTTAT
CACGGTGTGG TGATGCTTAA AGTTTCTCCG GTTGGATTGG CTTTATTGAA ATAA
 
Protein sequence
MNKFISGCFT LLLTFSFLVS FAQKNHVVWL DDLPIQSFSD GIRPVEVKAN YGKDTMCVKG 
VKYLRGLGAQ SISILKFDLS KQAIRFSAMA AVDDHGNKDI ALRFYVLGDG KILFESGERR
VGDEPLKVEV DLSGIKQLGL LVTDKVGGVG NKRTYANWIN AKLEMKEGHL PGYLRYPDQK
YILTPLPKQT PKINSAKVFG ASPGNPVLYT IAATGRRPMQ FSAPGLPKGL SISASTGIIT
GVVKEKGNYS VLLKAKNNLG EAKQKLVIKI GDTIALTPPL GWNGWNSWET KIDREKVMAS
AQAMVNKGLR DHGWNYINID DSWQGVRTRP DTALQPNEKF PDFKSMVDAI HALGLKAGLY
STPYVSSYGG YVGGSSDFPA GGETHERIKV NRQSFMHIGK YRFETIDARQ MASWGFDFLK
YDWRIDVNST ERMADALKKS DRDVVFSLSN NSPFEKVKDW MRLSHMYRTG PDIKDSWNSL
YTTVFSIDKW AAYTGPGHWA DPDMMIVGDV AIGPVMHPTK LTADEQYSHV SIFSLLAAPM
LIGCPIEKLD AFTLNLLTND EVIAINQDPL GKAGRLLLRE AGIEVWVKQL EDGAYGIGIF
NTAGYGETPQ SYFRWGDEKE KLYALDFTKI GLKGKWQIRD VWRQKSLGQY SGPFTTTVPY
HGVVMLKVSP VGLALLK