Gene Phep_0959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0959 
Symbol 
ID8252053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1121657 
End bp1123414 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content44% 
IMG OID644934614 
Productalpha-L-rhamnosidase 
Protein accessionYP_003091243 
Protein GI255530871 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.8736 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAT ACCTTGTTTT TCTTTCTTTG TTAACCTGGG CGTTTACCAC TGTGACGAAT 
GCTCAGTTGC CGCCAGTTTT TAAACAGCCA ATTGCCACCG GTTTAAAAAA AGATCCTAGA
GTTCGCTATT ATTTACCTCC GGAACGGATG ATATGGCGGT CTGACGCGAC TGGAAAATAC
ATCCAGCATG CAGATAGGTT GCTGAAAGTT GGCAATGGCC AGGCCGAGCT GGTCAATAAA
GACCTAACGG TGCTAAAAAA TGATAAAAGC TCAAAAACAG GTTTCTTAAT TGATTTTGGC
AAAGAAATTT TTGGCGGTCT GCAAATTACT ACGGGTTTAA TGAGAACGAA AACACCGGTG
AAAGTAAGGG TTCGTTTTGG AGAATCCGTA ACTGAGGCCA TGTCTGATGT AGGTGGTACA
GACGGTGCAA CTAATGATCA TGCCATGCGC GACTTTATTA TAGAGTTGCC ATGGCTGGGT
GGTCTGGAAG TAGGTAACAC AGGCTTTAGA TTTGTTAGAA TTGACCTGCT GGAGGCTGAT
ACTGAACTTT TGTTAAAAGA AGTGAGTGCG ATATTCATGT ACAGGGATAT TCCTTACCTG
GGGTCTTTTC ATTCTGATGA TGAACGTTTA AACCAAATTT GGGCAACCGG GGCATATACT
GTTCATTTAA ATATGCAGCA ATATTTATGG GATGGCATTA AACGTGACAA ACTGGTATGG
ATTGGGGATA TGCACCCTGA GATGATGGTG ATCAACAGTG TATTCGGTTA TAATGAAGTT
GTTCCAATGA GTCTGGATCT GGCTAAAGCC GCTACACCAC TGCCGGCATG GATGAATGGA
ATCAGTTCTT ATTCTATGTG GTGGGTCCTT ATTCAGCGCG ACTGGTATCT TCATCAGGGG
GATATGAAAT ACCTGCAGTT GCAAAGGCAG TACCTGATTG GGCTTTTGAA GCAGTTGATG
ACTAAAATAA AAGACGGGAA GGAAGCCCTG GACGGCAATC GTTTCCTGGA TTGGCCTTCC
TCTGAGAATA AGCCGGCTAT ACATGCCGGG CTGCAGGCCA TGCTGGTAAT GACCTTAACA
GCGGGATCGG AGCTGTGCCA TATTTTAAAG GAAGCGGAAA CTGCCCGAGC CTGTGATGAA
GCTGTTGCAA TTTTAAAGAA AAATGTCCCT GATATAGCTG AAAGCAAACA AGCAGCTGCA
TTGCTTGCCT TAGCCGGGCT TTTGCCGGCT GAACAGGCTA ACACTATCCT GTCAAAAGAC
AGCACCAGGG GATTCTCTAC TTTTTATGGC TACTATATGC TCCAGGCTAA AGCAATGGCT
GGTGACTATC AGGGTGCTAT CAATAATATC AGGGACTATT GGGGCGGGAT GCTGGATTTA
GGTGCCACTA CTTTCTGGGA GGATTTTGAT CTTTCCTGGA AAGAAAATGC CGGAAGGATC
GATGAAATTG TTCCTAAAGA TAAAGTTGAT GTCCATGCAA CATACGGGGC TTATTGCTAC
AAGAACCTCA GGCACAGCTT GGCGCATGGC TGGGCTGCAG GACCCACATC CTGGCTTACT
ACACATGTAC TGGGCATAAA AGTTATGGCC CCGGGTTGTA AAGTTGTTAA GATTGAGCCC
CATCTTGGCG ATCTGAAATC AGTAAGCGGC AGCTTTCCTA CGCCCTTTGG ATTAATAAAA
GTAAATCATT TAAAAATGCC TGACGGGAAA ATCAAAACGA CAGTAGACGC TCCAAAGCAG
GTTAAAATTA TTAAATAA
 
Protein sequence
MKRYLVFLSL LTWAFTTVTN AQLPPVFKQP IATGLKKDPR VRYYLPPERM IWRSDATGKY 
IQHADRLLKV GNGQAELVNK DLTVLKNDKS SKTGFLIDFG KEIFGGLQIT TGLMRTKTPV
KVRVRFGESV TEAMSDVGGT DGATNDHAMR DFIIELPWLG GLEVGNTGFR FVRIDLLEAD
TELLLKEVSA IFMYRDIPYL GSFHSDDERL NQIWATGAYT VHLNMQQYLW DGIKRDKLVW
IGDMHPEMMV INSVFGYNEV VPMSLDLAKA ATPLPAWMNG ISSYSMWWVL IQRDWYLHQG
DMKYLQLQRQ YLIGLLKQLM TKIKDGKEAL DGNRFLDWPS SENKPAIHAG LQAMLVMTLT
AGSELCHILK EAETARACDE AVAILKKNVP DIAESKQAAA LLALAGLLPA EQANTILSKD
STRGFSTFYG YYMLQAKAMA GDYQGAINNI RDYWGGMLDL GATTFWEDFD LSWKENAGRI
DEIVPKDKVD VHATYGAYCY KNLRHSLAHG WAAGPTSWLT THVLGIKVMA PGCKVVKIEP
HLGDLKSVSG SFPTPFGLIK VNHLKMPDGK IKTTVDAPKQ VKIIK