Gene Haur_1247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1247 
Symbol 
ID5733125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1455013 
End bp1456161 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content50% 
IMG OID641278387 
Productxylose isomerase 
Protein accessionYP_001544023 
Protein GI159897776 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2115] Xylose isomerase 
TIGRFAM ID[TIGR02631] xylose isomerase, Arthrobacter type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACAGC ATAAATTTAG CTTTGGATTA TGGACAGTTG GCAATGTTGG GCGTGACCCA 
TTTGGTGAAC CAGTGCGCAA AACCCTCTCG CCAGTTGAGA TTGTGCATTT GTTGGCTGAA
GTTGGAGCAT GGGGCGTAAA TTTTCACGAT AACGATTTAG TGCCGATTGA TGCTAGTGCC
AGCCAAAAAG CCCAAATTAT TGCCGATTTC AAACAAGCAC TCAAGGATAC CGGCTTGGTT
GTGCCGATGG CAACCACCAA TTTATTCGGT CACCCAGCCT TTAAAGATGG CGCATTTACC
AGCAACGATC CGGCTGTGCG GGCTTATGCC TTGCAAAAAA CCATGGCAGC CATGGATTTG
GGCGCTGAAT TTGGCGCGAA AACCTATGTG TTTTGGGGTG GCCGTGAAGG CAGCGAAACC
GATGCCTCGA AAAATCTGCT CGAAGGCTTG AAGTGGTTCC GCGAAGCGCT CAACTTCTTG
TGCGACTATA GCAATGCCCA AGGCTATGGC TATCGTTTTG CCTTGGAAGC CAAGCCCAAC
GAACCACGCG GCGATATCTT CTTGCCTACC ACCGGAGCCA TGTTGGGCTT TATTCAGACC
CTCGATCAGC CCGAGATGGT GGGGGTAAAT CCCGAAGTTG CCCATGAAAC CATGGCAGGC
TTGAATTTTA CCCATGCCGT GGCCCAAGCG CTTGATGCTG GCAAACTGTT CCATATCGAC
CTCAACGATC AAAATAGTGG TCGCTACGAC CAAGATTATC GCTTTGGAGC ACAAAACTAC
AAACAAAGCT TTTTCTTGGT GAAATTGCTG CAAGATGCTG GCTACGATGG CCCATTGCAC
TTCGATGCTC ACGCTTACCG CAGCGAAGAT CTTGAGGGAG TTAAAGATTT TGCCCGTGGT
TGTATGCGTA CCTACCAAAT TTTGGCCGAA AAAGTTCAGC GCTTCAATGC TGATGCTGAA
ATTCAAGCCT TGTTAGCTCA AATCAACGCC CCAAATGCCG ATGTTGAGCA ATTCCGTGGT
GGCTACACGC CAGAACGTGC CGCTGCGCTC AAAGCCTATC AATTTGATCG TCAAGCACTT
GGCGAACGCG GCCTCGGCTA CGAAAAGCTT GATCAACTAA CCTTCGAGTT GTTGATGGGA
GCCAGATAG
 
Protein sequence
MTQHKFSFGL WTVGNVGRDP FGEPVRKTLS PVEIVHLLAE VGAWGVNFHD NDLVPIDASA 
SQKAQIIADF KQALKDTGLV VPMATTNLFG HPAFKDGAFT SNDPAVRAYA LQKTMAAMDL
GAEFGAKTYV FWGGREGSET DASKNLLEGL KWFREALNFL CDYSNAQGYG YRFALEAKPN
EPRGDIFLPT TGAMLGFIQT LDQPEMVGVN PEVAHETMAG LNFTHAVAQA LDAGKLFHID
LNDQNSGRYD QDYRFGAQNY KQSFFLVKLL QDAGYDGPLH FDAHAYRSED LEGVKDFARG
CMRTYQILAE KVQRFNADAE IQALLAQINA PNADVEQFRG GYTPERAAAL KAYQFDRQAL
GERGLGYEKL DQLTFELLMG AR