Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_1167 |
Symbol | |
ID | 6743984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 1079852 |
End bp | 1081066 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 642750977 |
Product | protein of unknown function DUF224 cysteine-rich region domain protein |
Protein accession | YP_002121831 |
Protein GI | 195953541 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGAAA AACATTTTGA AATACCCATA GATTTGCTAC AAAAGTGCGT TAGATGTGGT CTTTGTAAAT CGGTATGCCC TACATATAGG GTAAATGAAG AAGAAAGAAG CTTTGCAAGA GGAAGACTAG CTTTGGCTCA GATGGTCCTA TCTGGAGAAC TGCCTCTTAC CAAAGATGTG GCAAGACAGT GGGACGAATG CGCCATGTGT AGAAGATGTG AGTGGATCTG TCCAAACAAC GTAGAGTACA AAGAGATTAT GTCAAAAGCC AAAGAGCTTC AATCAAACAC CCTTGGCAAA GATCCATTCA AATACACCGC TCTAAGCGCT CTTGAGCTAA TGCAAACAAA CGTTGGTAGA ACTGTCGTAA AAATGGCTGG TTCTTTATTA TCTTTAATCC CCAAAAAAGA ACTAAAAACT TACATACCTT CTGGTATAAA CGGTGCTGTA AAGTTCATGC CAAAACCCTC CAAAGACGCT TTTGGTATAA GAGGACAAAC CTTTAAAACC GATAAAACCC CTTCCAAAGG CACACTTCTT TTCTTTACCG GTTGCATGGT AGATGCTTTT TACACTACCA CTGGTAAAAA TGCCATAAAA GTCTTGAACA AAGCAGGTTA TGATGTAATT GTACCAAAGG ATATAAAGTG TTGCGGTGCT CCTCATCTTT ACTCTGGGAA TATTGAGGCT TTTAATATAC TAAAAGCCAA AAACCAAGAG GAGATTTCAA AATACAACTT TGATGCTATA GTGGTAGTTT GTCCCACCTG CGGTGGTGCA CTGTTGGAAG ATTATGGATA TAAAAATGTT TTAGATTTTG CAAGCATTGT AGCGTCTTCA AACGATCTTG TTTTAAAGTC AAAATCAAAA GAATCTGTCA CTTTCCATGT ACCCTGCCAC TCTTACAGCG CTATGAAAAC ACCGGTATCC GATTTTGAAA ACACCATAAA GAAAATAGAA AATGTAGAGT ACAATAAAGC TTCTAAAGCT CAAAGCTGTT GTGGTTTTGC TGGGCTTTTT TCCATGAAAA ACCCGGAGCT TTCTACAGCT ATTCAAAAAG AGAAAATGGA AGACTTCAAA TCCACAAACG CAGAATATAT ACTAAGCGCT TGTCCTGGAT GTGTTTTACA GCTTCAAGAT GGCAACCTTA AGTTTAAAAA TAATCAAAAG ATCATGCATA TAGCAGATTT TGTGGCAAAC AAGTTAGAAG ATTGA
|
Protein sequence | MEEKHFEIPI DLLQKCVRCG LCKSVCPTYR VNEEERSFAR GRLALAQMVL SGELPLTKDV ARQWDECAMC RRCEWICPNN VEYKEIMSKA KELQSNTLGK DPFKYTALSA LELMQTNVGR TVVKMAGSLL SLIPKKELKT YIPSGINGAV KFMPKPSKDA FGIRGQTFKT DKTPSKGTLL FFTGCMVDAF YTTTGKNAIK VLNKAGYDVI VPKDIKCCGA PHLYSGNIEA FNILKAKNQE EISKYNFDAI VVVCPTCGGA LLEDYGYKNV LDFASIVASS NDLVLKSKSK ESVTFHVPCH SYSAMKTPVS DFENTIKKIE NVEYNKASKA QSCCGFAGLF SMKNPELSTA IQKEKMEDFK STNAEYILSA CPGCVLQLQD GNLKFKNNQK IMHIADFVAN KLED
|
| |