Gene Phep_1297 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1297 
Symbol 
ID8252397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1539167 
End bp1540738 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content44% 
IMG OID644934951 
Productglycoside hydrolase family 39 
Protein accessionYP_003091574 
Protein GI255531202 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3664] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCCC TGGTTTTAAA TCTATGCACC GTGTTGCTGG GGTTTTTCCT GTGTCTTTCT 
GTTCAGCCGG TAAAGGCACA GGTTAAACGT CAGATCCATG CCGACCTGAA ATCGGTTGCC
GGAAATAAAA GCACAGTATT TAATGAATGT ATTGGTGCGG GAAGGGCAAA CGAGGGCCTG
CGGGCCGACT GGCAGCAGCA GCTAGCCATG ATCCAGAAAG ACGTACATTT TAAATACATC
CGTTTTCACG GCCTTTTGCA CGATGACATG CGGATTTATA GCCTGGATAA AACCGGTAAG
CCGGTTTATA ATTTTCAGTA TGTAGACCGG TTATACGATT TTTTACTGAG CATCAACATC
CGGCCTTTTG TCGAATTTGG CTTTATGCCG CCCGACATGG CATCGGGTAC CAAAACCATT
TTCTGGTGGA AAGCGAATGT GAGCAAACCC AAATCCTATA CACAATGGGA TACGCTGATT
ACCAAATTGG TGAAACATTG GCAGCAGCGT TATGGAGAAG AGGAAGTGAA AAAATGGTAT
TTTGAAGTAT GGAACGAGCC TGACCTCAAG GGATTTTTTG ATGGAACCCA GGCTGATTAT
TTCGAATTAT ATGCTCATAC CGTTAAAGCA GTTAAAAGTG TTTCCAGGGC TTACCGTGTG
GGTGGCCCTG CTACCTCGGC TACCAAGTGG ATCGCTGAGT TTTTATCCTA TTGTGAAAAC
AATAGATTAC CTGTCGATTT TGTGAGTACA CATGATTATG GTACAACTTC GGTACTGGAC
GAGATGGGAA CCAAAAAACA ACAACTGAAA AGCATCCGTG ACACCATTGC CATTGATGTA
AAGCGGGTAA GAGCCACCAT TAATACATCG GCCTATAAAA AAGCAGAGCT GCACTTTACA
GAATGGAACA CTTCTCCATC GTCCAGAGAC CCGATCCACG ATACTTATCA AAACGCGGCG
TATATTTTAC ACGTATTAAA AAAGGCTGCT GCCAATAGCA ATTCCATGTC GTACTGGACC
TTTACTGATA TTTTTGAGGA AGCAGGCCCG GGCCCTACAC CTTTTCATGG GGGGTTTGGA
CTGATCAATC TCCAGGATAT CAAAAAACCG GGATATCATG CTTATCATTT CATGAAGGAG
CTGGGTAATA CAGAGCTAAA AAATGAGGAT GAAAGCTCGT ATGTCTGTAA ATCTGAGCAA
GGGGTACAGG CATTGATCTG GAATTACACC CATCCCTTAA ATGGATACAG CTTTAACCAG
GATTACTTTA ATAAAGTACA GCCACCTGCA ACCAACCATG ATGTCCGGTT CAGTGTGGCC
AATCTGAGGA ATAGTACTTA CCTGCTGGAA GAGTACAGGG TAGGTTACGG GCAAAATGAT
CCCTACACTG CTTACCTGAA AATGGGAAAG CCAGAACAAT TAAGTAGGGC AGAAGTTAGC
CGGTTAAAGC AGGTGTCGTC CGGAGCTGCA TTCCGTAAAA GCACCATAAC AGTAAGCGGT
GGGAAATTTG AACAGCTTAT CCGGCTTTCT GACAATGAAA TCGTATTCCT GAAGCTGATC
AGACACCTTT AA
 
Protein sequence
MRALVLNLCT VLLGFFLCLS VQPVKAQVKR QIHADLKSVA GNKSTVFNEC IGAGRANEGL 
RADWQQQLAM IQKDVHFKYI RFHGLLHDDM RIYSLDKTGK PVYNFQYVDR LYDFLLSINI
RPFVEFGFMP PDMASGTKTI FWWKANVSKP KSYTQWDTLI TKLVKHWQQR YGEEEVKKWY
FEVWNEPDLK GFFDGTQADY FELYAHTVKA VKSVSRAYRV GGPATSATKW IAEFLSYCEN
NRLPVDFVST HDYGTTSVLD EMGTKKQQLK SIRDTIAIDV KRVRATINTS AYKKAELHFT
EWNTSPSSRD PIHDTYQNAA YILHVLKKAA ANSNSMSYWT FTDIFEEAGP GPTPFHGGFG
LINLQDIKKP GYHAYHFMKE LGNTELKNED ESSYVCKSEQ GVQALIWNYT HPLNGYSFNQ
DYFNKVQPPA TNHDVRFSVA NLRNSTYLLE EYRVGYGQND PYTAYLKMGK PEQLSRAEVS
RLKQVSSGAA FRKSTITVSG GKFEQLIRLS DNEIVFLKLI RHL