Gene Phep_2303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2303 
Symbol 
ID8253409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2679376 
End bp2681094 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content44% 
IMG OID644935952 
Productalpha-L-rhamnosidase 
Protein accessionYP_003092569 
Protein GI255532197 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.158323 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.125431 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGTCC ATAAATTAAT TCCCCGTCTA TTGGTATTGT TTCTAACCGT ACAGCTTGCA 
AATGCACAGC AGAAAAGCCC TCTTACCCGT CAATACATCA CACCTGTAAG AATTGTGTGG
AAAAACGGTG ATGTTAGAAA TAGTGAACAG CTCTTAAAGC CAGGTATTGG CCAAGGTGAT
CTCGCCAATA GGAATATCGT TATTCTTAAT AACAAAAAGG CTGGTGAAAA AGCGAGTATA
CTACTTGACT TCGGCAGAGA GCTACATGGT GGCTTAGAGA TCGTAACAGG GATGTGGGGC
GGTGGAAATA AACCGCGCAA TATCCACATT CGCTATGGCG AATCTGTCAG TGAAGCAATG
TCAAGCATAG GCGAAAAAGG CGCTACGAAC GATCATGCGA TCCGTGACTT TAGGGCACTT
GTGCCATGGC TAGGAAAGAT TCAATTTGGG GAAAGCGGTT TTCGCTTCGT TCGCATCGAT
CTTGAAGATG AGCATGCGGA ACTGCAATTG AAAGAAATTC GCGCAATTTA TACCTTCAGA
GACATACCTT ATCTCGGTTC TTTTAAAAGC AGTGATGAAC GTTTGAATAA AATATGGCAA
ACAGGGGCAT ACACAGTGCA TTTGAATATG CAAGAGTATC TATGGGACGG CATAAAAAGA
GACCGACTGG TTTGGGTTGG AGATTTACAT CCTGAAGTAT CTACCTTAAG TGCTGTCTTT
GGCTATAATG AAGTGGTGCC TAAAAGTTTA GATTTATCAA GAGACACTAC GCCCTTACCA
GGATGGATGA ATGGGATTAG TACCTACTCC ATGTGGTGGA TCATCATCCA CTATGATTGG
TATATGAAAA ATGGCAATTT GGCATATCTA AAAGAGCAGA AGACATATCT GAATGGATTA
GTGAAACAAA TTGTGGCTAG GGTGGGGAAC GACAACAAAG AGCATATGGA CGGTACGCGC
TTCCTTGACT GGCCCTCTAG CGAAAACCCC AAAGGCATAC ATGCAGGTCT ACAGGCAATG
ACCGTATGGT CACTCGCTAC AGCTGCTAAA ATAAGCGTCT TAGTGGGAGA TAAGGAAACC
GAGGCCCTCT GCAATCTTAC CGTTACACGA ATGAAGAAAT ACATTCCTGA TGTAAATAAC
TCAAAACAAG CAGCCGCATT AATGGCTATT GCAGGCATCA CAACAGCCGA AAAAGCAAAC
AAAGAAGTGC TCTCGGTAGG CGGGGCCAAA AATTTCTCTA CATTTTATGG TTATTACATG
CTGGAGACAA AAGCAAAAGC AGGAGACTAT CAGGGTGCTA TAGACGTTAT CCGGGAATAT
TGGGGCGCAA TGTTAGACCT TGGTGCTACT ACATTCTGGG AGGACTTTAA CATGGATTGG
CTCCCCAATG CTTCCCGTAT TGACGAACCC GTGCCTGCCG GTAAAATAGA CATTCATGGC
GATTACGGCG CCTACTGTTA CGTGGGTTTC CGGCATAGCC TTTGTCATGG ATGGGCTTCT
GGACCCACCT CGTGGCTTAC GGAACATGTT TTAGGTATAA AGGTGATGGC TCCGGGCAGT
AAAATCATTA AAATAACACC CCATCTGGGA GACCTTAAAT TTGCAGAGGG CACTTTTCCT
ACACCCTACG GTGTGGTAAA AGTAAAGCAT ACCAAACTTG CCAATGGAAA AATTCATTCA
GAAATCACCG GACCTAAACA AGTCAAAATA ATTCGCTGA
 
Protein sequence
MIVHKLIPRL LVLFLTVQLA NAQQKSPLTR QYITPVRIVW KNGDVRNSEQ LLKPGIGQGD 
LANRNIVILN NKKAGEKASI LLDFGRELHG GLEIVTGMWG GGNKPRNIHI RYGESVSEAM
SSIGEKGATN DHAIRDFRAL VPWLGKIQFG ESGFRFVRID LEDEHAELQL KEIRAIYTFR
DIPYLGSFKS SDERLNKIWQ TGAYTVHLNM QEYLWDGIKR DRLVWVGDLH PEVSTLSAVF
GYNEVVPKSL DLSRDTTPLP GWMNGISTYS MWWIIIHYDW YMKNGNLAYL KEQKTYLNGL
VKQIVARVGN DNKEHMDGTR FLDWPSSENP KGIHAGLQAM TVWSLATAAK ISVLVGDKET
EALCNLTVTR MKKYIPDVNN SKQAAALMAI AGITTAEKAN KEVLSVGGAK NFSTFYGYYM
LETKAKAGDY QGAIDVIREY WGAMLDLGAT TFWEDFNMDW LPNASRIDEP VPAGKIDIHG
DYGAYCYVGF RHSLCHGWAS GPTSWLTEHV LGIKVMAPGS KIIKITPHLG DLKFAEGTFP
TPYGVVKVKH TKLANGKIHS EITGPKQVKI IR