Gene Phep_2237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2237 
Symbol 
ID8253343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2589552 
End bp2590601 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content48% 
IMG OID644935886 
Productglycoside hydrolase family 43 
Protein accessionYP_003092503 
Protein GI255532131 
COG category[R] General function prediction only 
COG ID[COG3940] Predicted beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.572744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.998954 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCCA CCTTAAGCGG ATATACCGCT GCAGATGAGG GTAAAAATGA ATTACCGGAT 
ACCTTAAAAG GGATGTTTAA AAACCCGATA GCAGCGGGGG CAGATCCCTG GGTAATTAAA
TCGGGAAAAT ATTATTATAC CTGCCTGAGC AACGGGAACG TGGACAGTAA AGGGATTTCG
GTTTGCAGAT CACTGAAACT TACGGAGCCC GGCAGCAAAA TAACAGTATG GACAGCCCCG
GATACCGGCT GGAATTCGAC CCAGATCTGG GCTCCTGAAC TGCATCACAT GAACAACAGG
TGGTATATTT ATTATGCCGC GGGCAGAAAA AAGGGAGCGC CGTATATCCA TCAGCGCTCT
GGTGTACTGG AATCGGTTTC TGATGATCCG CAGGGACAAT ATATAGACCG GGGATTATTA
CAGACAGGTG TAGATAAGAA TGATCCGAGT GGTACGATAT GGGCAATTGA TGTAAATGTA
GCCAGTATAA AGGGCAAACT CTATGCAGTA TGGTCAGGAT GGGAAAAAAA TATGGATACA
GATAAAACAT CGCAGCAGCT TTATATTGCA GAGATGAGCA ATCCCTGGAC GATCAGTTCA
AAACGGGTTA AACTATCGGG CCCCGACCAG CCATGGGAAC AGGGAGGCCC TTTGAACCTG
AACGAAGGCC CCGAGTTTTT ACTGCATAAG GGACAGGTTT TTATCATTTA CTCTACCCGT
GAATCCTGGA CACCTGAATA CAGACTTGGA CAGCTCCGTT TAAAGGATCC GGCCAGATCA
CTCCTGGATG CTGCCAACTG GCTGAAATCC GGTCCTGTAT TTCAGGGTAC CCAGACAGTT
CATGGCACGG GGCATGCGAG TTTTACCACT TCGCCAGACG GGAAGGAATG GTGGATGATT
TACCATACCA AGCGTAGCAC AAAGCCGGGC TGGGAACGTG ATATCATGAT GCAAAAGTTT
AAATGGGACA AAGATGGCAA CCCGGATTTT GGAAAACCGG AGCCAGCAGG CAAACTGTTG
AAAAAGCCTT CGGGAGAAGA GGGGGGATAA
 
Protein sequence
MLATLSGYTA ADEGKNELPD TLKGMFKNPI AAGADPWVIK SGKYYYTCLS NGNVDSKGIS 
VCRSLKLTEP GSKITVWTAP DTGWNSTQIW APELHHMNNR WYIYYAAGRK KGAPYIHQRS
GVLESVSDDP QGQYIDRGLL QTGVDKNDPS GTIWAIDVNV ASIKGKLYAV WSGWEKNMDT
DKTSQQLYIA EMSNPWTISS KRVKLSGPDQ PWEQGGPLNL NEGPEFLLHK GQVFIIYSTR
ESWTPEYRLG QLRLKDPARS LLDAANWLKS GPVFQGTQTV HGTGHASFTT SPDGKEWWMI
YHTKRSTKPG WERDIMMQKF KWDKDGNPDF GKPEPAGKLL KKPSGEEGG