Gene Phep_3647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3647 
Symbol 
ID8254778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4361201 
End bp4362253 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content48% 
IMG OID644937308 
ProductXylose isomerase domain protein TIM barrel 
Protein accessionYP_003093900 
Protein GI255533528 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1082] Sugar phosphate isomerases/epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAA TTAAAGGACC GGGTGTATTT TTAGCCCAGT TTATAGGAGA CGAGGCCCCG 
TTTAATTCCT TAGATGCGAT ATGCGAATGG GCAGCAGGCC TGGGTTTTAA AGGCATACAG
ATGCCTACCC TGGATGCGAG GTTCATAGAC CTGCAAAAAG CAGCGGAAAG TAAAACTTAT
GCCGATGAAT TAAAAGGCAG GGTAGCTGGA CACGGACTGG AGATCACAGA ATTATCTACC
CATTTACAGG GACAGCTGGT AGCGGTAAAC CCGGCTTACG ACCTTGCCTT TGATGCTTTT
GCACCAGATG CTTACAAAAA CAATCCAAAA GCAAGAACAG AATGGGCGGT ACAGCAACTG
AAATATGCGG CTAAGGCCTC TCAGAACCTT GGCTTAAATG CACATGCCAC TTTTAGCGGT
TCGCTGTTAT GGCATATGTT CCATCCCTGG CCGCAACGCC CGGAAGGGTT GGTGGATGCC
GGTTTTACAG AGCTGGCAAA AAGATGGATG CCCATCCTGA ACGAATTTGA TGCCTGTGGC
GTAGATGTTT GTTATGAGAT CCATCCGGGT GAGGACCTCT TTGATGGCAT CACTTACGAG
ATGTTCCTGG AAAAGGTGAA CAACCATCCC CGCGCCTGTT TGCTTTACGA TCCTTCACAT
TTTGTATTGC AGCAGCTGGA TTACATCCAG TACATAGATT TTTACCATGA GCGCATCAAA
GCCTTCCATG TGAAGGATTC TGAATTTAAC CCCACTGGTC GGCAGGGTAC TTTTGGCGGC
TACCAAAGCT GGGCCAACCG TGCCGGAAGG TACCGTTCGC CGGGTGATGG ACAGGTTGAT
TTTAAGACCA TCTTCAGCAA ACTGGCGCAG TACGATTATA CCGGTTGGGC GGTGATGGAA
TGGGAATGCT GCATCAAGGA TGCGCAGGTA GGTGCAAAGG AAGGAGCAGA ATTTATCCGC
AAGAACATCA TTAAGGTAAC CGACAGGGCA TTTGACGATT TTGCGGCTTC GGGCGGGAAT
GATGATTTTA ACAAGAAGAT CTTAGGCTTA TAA
 
Protein sequence
MKTIKGPGVF LAQFIGDEAP FNSLDAICEW AAGLGFKGIQ MPTLDARFID LQKAAESKTY 
ADELKGRVAG HGLEITELST HLQGQLVAVN PAYDLAFDAF APDAYKNNPK ARTEWAVQQL
KYAAKASQNL GLNAHATFSG SLLWHMFHPW PQRPEGLVDA GFTELAKRWM PILNEFDACG
VDVCYEIHPG EDLFDGITYE MFLEKVNNHP RACLLYDPSH FVLQQLDYIQ YIDFYHERIK
AFHVKDSEFN PTGRQGTFGG YQSWANRAGR YRSPGDGQVD FKTIFSKLAQ YDYTGWAVME
WECCIKDAQV GAKEGAEFIR KNIIKVTDRA FDDFAASGGN DDFNKKILGL