Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_0340 |
Symbol | |
ID | 6743134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 298658 |
End bp | 300331 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 642750133 |
Product | cytochrome d1 heme region |
Protein accession | YP_002121008 |
Protein GI | 195952718 |
COG category | [C] Energy production and conversion |
COG ID | [COG2010] Cytochrome c, mono- and diheme variants |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000596399 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATTA GGTTAATTTT TCTTAGCTTG CTTCTTTGCT GTCCATTGCT ATCCTCCTTC TCCTCTCCAG CAAGTCTGGG GGGAGATTTA TACAAAGACA AGTGCGCTAA GTGTCACGGT ATTAATAGAA TGGGATTATC CGGGCCACCA TTAATTCCTG AATTCCTAAA ATCAAAATCT GACAACACAC TTTTTAAGAT TATAAAAAAC GGTATACCTC CTACGCTTAT GCCGTCCTTT AAAGATTTAA CAGACAAACA AATAGAAAGT CTCATTGCTT TTGTTAAGAC GCCCGTTACT GTGAACACCT GGAGTTTATC TGATATAAGG AAGACATGGT ATAAGTTAAG TGCTCAAAAC CCAAAGCCAT GGCATAGAAC AATAGTAAAT TTTATGGCTT TGGTGGAAGC TGGTAAGAAC AGGGTGTGGT TTCTAAATAA AGCCCATGTA ATAAACAAAA TAAATTTTGG AGACATCCAT GGGGGGCTAA AGTTTTCTAA CGATGGGAAG TTTATCTACA TTCCTTCAAG GACAGGATAT ATAGGTAGAT ACAACATAAG AAAAGGATAC TTGGATTATA AAGTAAGGGC ATGCTTATAC CAAAGAAATG TAAGCTTATT TTCCAACGGG GTAATATCAG CATGTTGGTT ACCTTCTCAA ATAGTAATCC TTAACAAAAA ATTAGAACCA AAAGCCGTAA TTCCAATAGA TGGCAAAATT AGTGCTATCT ATGGGTTATA CAATTCTCCT TACGCCGTGT TTGGTTTAAT GCATAAAAGT GTATTGGGTT TTCTCAACAC AAGCTCTTTA AACCTAAAAT ACTACGATGT TCCTACTGCT TTTGAAGACT TTTTCATAGA TCCGTTGGAG CATTTCGTAG TGGGAACTTC ATTTAATCAC AACGTACTAT CAGTATTTGA TATAAAATCC AAAAAAATTG TATATCAAAC CTCCTTTGGA GGGATGCCTC ATCTTGCTTC TGCAGCCTTT TGGTATAAAG ATAAAAATTT TTACTTTGCC ACCCCAGATT TAAAGAAACC AGTAATCACC ATATGGAGAG CCTACAAATG GAAAAAGATA AAAGAGATTC CTTCAAAAGG TGTTGGATTT TTTGTTAAAA CCAATCCAAA CACACCTTAT CTATGGGCTG ATGAACATGC AAACACGTTA CTTCTAATAA AGAAGACCGA TTTTAAACCC ATCCATGTAA AACTAGTTAA GAAAGGTTGG ATAATCCACA CCCAGTTTTC GGGAGATGGT AAATATGCCT ATGTAAGCGA TTATGCAAAA AATGGTAACG TTTTTATACT TGATTCAACG ACTCTTAAAC TTATAAAATC TTTTCCAGCA TCCTATCCCA TTGGAAAATA CAACTACGTT ACGTACTCAA ACAGAAGAGA AGCAGCCCTT TTAGGTGAAG AGGTGTATTT ACAATATTGC TGGGGATGTC ATCACCCCAC TAGAACCGCC TTTGCCCCTT CTTTTAAATA TATAGCAAAG CATATACCCA TATCTTTAAT AAAAGCTCAG ATACTAAGCC CAGATCAAAC ATATAAGCTG TTGGGATTTA AGCAAAATGT TATGCCAAAG TTCCACCTGT CAAAATACGA GATAGACGCC TTGCTGATGT TTATTGAATA TTCTAAAAAT AGAAACTTTT GGGATAGCCA CTGA
|
Protein sequence | MRIRLIFLSL LLCCPLLSSF SSPASLGGDL YKDKCAKCHG INRMGLSGPP LIPEFLKSKS DNTLFKIIKN GIPPTLMPSF KDLTDKQIES LIAFVKTPVT VNTWSLSDIR KTWYKLSAQN PKPWHRTIVN FMALVEAGKN RVWFLNKAHV INKINFGDIH GGLKFSNDGK FIYIPSRTGY IGRYNIRKGY LDYKVRACLY QRNVSLFSNG VISACWLPSQ IVILNKKLEP KAVIPIDGKI SAIYGLYNSP YAVFGLMHKS VLGFLNTSSL NLKYYDVPTA FEDFFIDPLE HFVVGTSFNH NVLSVFDIKS KKIVYQTSFG GMPHLASAAF WYKDKNFYFA TPDLKKPVIT IWRAYKWKKI KEIPSKGVGF FVKTNPNTPY LWADEHANTL LLIKKTDFKP IHVKLVKKGW IIHTQFSGDG KYAYVSDYAK NGNVFILDST TLKLIKSFPA SYPIGKYNYV TYSNRREAAL LGEEVYLQYC WGCHHPTRTA FAPSFKYIAK HIPISLIKAQ ILSPDQTYKL LGFKQNVMPK FHLSKYEIDA LLMFIEYSKN RNFWDSH
|
| |