Gene HY04AAS1_1103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_1103 
Symbol 
ID6743918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp1021245 
End bp1022606 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content40% 
IMG OID642750911 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002121767 
Protein GI195953477 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000141621 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGAAG ATTGGAGCCT AAGAAGTAAC TGGACCAATA AAACACAAAT GCACCTTGCA 
AGGCAGGATA TTATAAGCGA GGAAATGCGT TATGTGGCAA AGGTGGAAGG CCTTCACCCA
GAATTTGTGC GTCAAGAGGT GGCAAGAGGC CGTATGATAA TGCCTGCCAA CATAAACCAC
AAACACTTAA AACCTATGTG TATAGGCATA AATTCAAGAG TTAAAGTAAA TGCAAATATA
GGAAACTCAA AACTTGCATC AGATATACCC CAAGAGATAG AGAAAGCAAA AATCTCCATC
AAATATGGCG CAGATACCAT CATGGATCTA TCCACTGGTG AAGCTATAAA AGAAACTAGA
GAAGCTATTA TAAATGCAGT TGATGTGCCA ATAGGTACAG TGCCAATATA TGAAGCTTTT
AAGATAGCAA AAAATAGAAT AAAGGATATT ACCGAAGATC TTATTTTAGA TGTAATAGAA
GAACAAGCAA GGCAAGGTGT TTCTTATATG ACTATTCACG CTGGCGTGTT AAAGGAGTTT
ATACCCCTTA CTACTCATAG AGTTATGGGT ATAGTATCAA GAGGCGGTGC TCTCATGGCA
CAGTGGATGC TGGAGCATGG AAAGCAAAAC CCTCTTTATA CGAACTTTGA TAAAATATGT
GAAATATTTA AAAAATACGA TGTATCTTTT TCTTTAGGAG ATGGGCTAAG ACCTGGAGCC
ATAGCAGATG CATCAGATGA AGCTCAACTT TCAGAGCTTA TGGTGCTTGG AGAACTCACA
AAGAAAGCAT GGGAGCACGA TGTGCAGGTA ATGGTGGAAG GGCCTGGGCA TGTGCCAATG
AATCAAATAG AATTTAATAT GAAAATCCAA CAAAAATATT GTTATGAAGC ACCATTTTAC
GTGCTTGGTC CTTTGGTGAT AGATGTGGCA CCAGGTTATG ATCATATGGC TTCTGCTATA
GGTGCTGCAA TGGCTGGTTG GTATGGGGCC GCTATGCTTT GTTATGTGAC TCCAAAAGAA
CATCTTGGTT TACCTAATCT AGAAGATGTA AAACAAGGTC TTATAGCTTA CAAAATAGCT
GCTCACGCTG CAGATGTGGC AAAAGGTTTA CCAGGAGCCA GAGAATGGGA CTTAGAAATG
TCAAAAGCAA GATACGCTTT TGATTGGAAT CGCCAATTTG AACTTGCTAT AGACCCAGAA
ACCGCAAAAG CTTATCACGA TGAAACGCTT CCACAAGAGG GATACAAGAC TGCAAAATTC
TGTTCTATGT GTGGGCCTGA GTTTTGTTCT TACCGTATAT CCCAAAACGT GCAAACAAAC
TTTGAAGAGC AACTAGCTGA AGGTACATGG CAAACTCCTT GA
 
Protein sequence
MREDWSLRSN WTNKTQMHLA RQDIISEEMR YVAKVEGLHP EFVRQEVARG RMIMPANINH 
KHLKPMCIGI NSRVKVNANI GNSKLASDIP QEIEKAKISI KYGADTIMDL STGEAIKETR
EAIINAVDVP IGTVPIYEAF KIAKNRIKDI TEDLILDVIE EQARQGVSYM TIHAGVLKEF
IPLTTHRVMG IVSRGGALMA QWMLEHGKQN PLYTNFDKIC EIFKKYDVSF SLGDGLRPGA
IADASDEAQL SELMVLGELT KKAWEHDVQV MVEGPGHVPM NQIEFNMKIQ QKYCYEAPFY
VLGPLVIDVA PGYDHMASAI GAAMAGWYGA AMLCYVTPKE HLGLPNLEDV KQGLIAYKIA
AHAADVAKGL PGAREWDLEM SKARYAFDWN RQFELAIDPE TAKAYHDETL PQEGYKTAKF
CSMCGPEFCS YRISQNVQTN FEEQLAEGTW QTP