Gene Phep_1604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1604 
Symbol 
ID8252706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1899723 
End bp1901051 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content48% 
IMG OID644935258 
Productxylose isomerase 
Protein accessionYP_003091879 
Protein GI255531507 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2115] Xylose isomerase 
TIGRFAM ID[TIGR02630] xylose isomerase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAC TTACTGCAGA CAACGAATAT TTCAAAGGGA TCGGACAGAT CAGCTTTGAA 
GGACAGGAAA CAGACAACCC GCTGGCTTTC AGATGGTACA ATCCTGAACA GGTGGTTGCC
GGCAAAAAGA TGAAAGAGCA CCTGCGTTTT GCCGGTGCTT ACTGGCATTC TTTCTGCGGA
AATGGTACAG ATCCCTTTGG CGGTCCGACA CATATTTTTC CCTGGGACGC GAAAGCGGAT
GTACTGGATC GTGCAAAGGA CAAAATGGAT GCAGCCTTTG AATTTCTGAC CAAAATGAAC
CTGCCCTATT ACTGCTTTCA TGATGTGGAT GTGGTAGATT ATGGCAACGA CATCAAAGAA
AATGAAAGAC GGATGCAGAT CATGACCGAT TATGCAAAAG CCAAACAGGC AGAAACAGGT
GTAAAATTGC TTTGGGGTAC GGCTAATCTT TTCTCTCACC GCAGGTATAT GAACGGAGCG
GCTACCAATC CCGACTTTCA TGTGCTGAGC CATGGCGCAG CACAGGTAAA AGCAGCCCTT
GATGCCACCA TAGCCCTTAA TGGGGAAAAT TATGTATTCT GGGGTGGCCG CGAAGGTTAC
ATGAGCCTCC TGAACACCAA TATGAAACGC GAACAGGAAC ATCTGGCAAA ATTTCTGCAT
ACAGCCAAAG ATTATGCCCG TAAAAATGGT TTCAAAGGCA CCTTCTTTAT TGAGCCCAAA
CCTTGTGAAC CCACCAAGCA CCAGTACGAT TACGATGCAG CAACCGTACT TGGCTTTCTC
CGTCAGTACG ACCTGCTGGG TGATTTTAAA CTGAACCTGG AAGTTAACCA TGCTACGCTG
GCCGGACATA CCTTCCAGCA TGAGCTGCAG GTGGCTGCTG ATGCCGGAAT GCTGGGCTCT
ATTGATGCCA ACCGCGGCGA CGAACAAAAT GGCTGGGATA CAGACCAGTT TCCAAACAAC
ATCAATGAGG TTACAGAATC CATGCTGATC ATCCTGGAAG CAGGGGGCCT GCAAGGTGGG
GGTATAAATT TCGATGCCAA GATCCGCAGG AATTCAACGG ATCCGGCCGA CCTTTTCCAT
GCACATATTG GTGGAATGGA TATTTTCGCC CGGGCCCTGA TTACCGCCGA CCGCATCCTT
CAGCATTCTG AATACAAAAA AATAAGGGCA GAAAGATATG CGTCTTACGA CAGTGGAAAA
GGCAAAGCCT TTGAAGAAGG GAGCTTAAGC CTGGAAGACC TGCGCGATTA TGCAGTGGCA
CAGGGCGAAC CGCAAACCAT CAGCGGCAAA CAGGAATTCC TGGAAAACCT GATCAACAGG
TATATTTAA
 
Protein sequence
MTKLTADNEY FKGIGQISFE GQETDNPLAF RWYNPEQVVA GKKMKEHLRF AGAYWHSFCG 
NGTDPFGGPT HIFPWDAKAD VLDRAKDKMD AAFEFLTKMN LPYYCFHDVD VVDYGNDIKE
NERRMQIMTD YAKAKQAETG VKLLWGTANL FSHRRYMNGA ATNPDFHVLS HGAAQVKAAL
DATIALNGEN YVFWGGREGY MSLLNTNMKR EQEHLAKFLH TAKDYARKNG FKGTFFIEPK
PCEPTKHQYD YDAATVLGFL RQYDLLGDFK LNLEVNHATL AGHTFQHELQ VAADAGMLGS
IDANRGDEQN GWDTDQFPNN INEVTESMLI ILEAGGLQGG GINFDAKIRR NSTDPADLFH
AHIGGMDIFA RALITADRIL QHSEYKKIRA ERYASYDSGK GKAFEEGSLS LEDLRDYAVA
QGEPQTISGK QEFLENLINR YI