Gene Phep_1044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1044 
Symbol 
ID8252138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1227793 
End bp1228710 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content45% 
IMG OID644934697 
ProductXylose isomerase domain protein TIM barrel 
Protein accessionYP_003091326 
Protein GI255530954 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1082] Sugar phosphate isomerases/epimerases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACAA AAGACCGCAG AAGTTTTATT AAAGATTTAG GGATGCTGAC AGCAGGGGTA 
GGGCTTGCTT CGTTGCTGCC ATTGGAAGCT TTAAGTGCTT TTAAAAAGGA GCCTTATCTC
ATTTCCCTGG CCCAATGGTC GCTGCACAAT ACCTTGTTTG CCAAAAAACT GGACAACCTT
GATTTTCCGC TGAAAGCGAA AAGGGATTTT GATATTCACA TTGTAGAATA TGTAAGTATA
TTTTTTGATA AGAAAGAGAA AGACCCGGCT TACCTGAAAG AACTCAAAAA CCGGACTGAT
TCTGAAGGGA TCCAGAACCA CCTGATTATG GTAGACCGGG AAGGTAACCT GGGCGATACC
GACGATAAAG CAAGACTAAC TGCGGTAGAG AACCACTATA AATGGGTAGA TGCAGCCAAA
TTTCTGGGTT GTAAGACCAT CCGTGTAAAT GCAGGGGGAA AGGGAACAGC TGCGGAAGTA
AAAGCTGCGG CCATTGATGG TCTGGGCAGG TTAACCGAAT ACGGTAAAAA GAACAGGATC
AATGTTATTG TAGAGAACCA CGGTGGTTAT TCTTCCGATG GCAAGTGGCT GACGGATGTA
ATCAAAGGGG TAAACAGCTC TTATTGTGGA ACCTTGCCCG ACTTTGGGAA CTTTGCCCTG
GGGAACGGAA AGGAATATGA CCGGTACCTG GGCGTGGAAG AAATGATGCC TTTTGCCAAG
GGGGTAAGTG CTAAAACGAT GAAATTTAAT GCTGATGGTG AAGAAAGCGA CATCGATTAC
AGCCGCATGT TCAGGATCAT TAAAGCCGCA AAATGGAATG GAATAGTAGG GATTGAGTAT
TCAGGGGCAG GAGAAACGGA AGACGAAGGG ATCAGGAAAA CGAAGGCCCT GTTAGAGAAA
GTGTTTAAAC AGGGATAA
 
Protein sequence
METKDRRSFI KDLGMLTAGV GLASLLPLEA LSAFKKEPYL ISLAQWSLHN TLFAKKLDNL 
DFPLKAKRDF DIHIVEYVSI FFDKKEKDPA YLKELKNRTD SEGIQNHLIM VDREGNLGDT
DDKARLTAVE NHYKWVDAAK FLGCKTIRVN AGGKGTAAEV KAAAIDGLGR LTEYGKKNRI
NVIVENHGGY SSDGKWLTDV IKGVNSSYCG TLPDFGNFAL GNGKEYDRYL GVEEMMPFAK
GVSAKTMKFN ADGEESDIDY SRMFRIIKAA KWNGIVGIEY SGAGETEDEG IRKTKALLEK
VFKQG