Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_1580 |
Symbol | |
ID | 6744412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 1496096 |
End bp | 1497625 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642751400 |
Product | protein of unknown function DUF1703 |
Protein accession | YP_002122239 |
Protein GI | 195953949 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00740643 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTTT TACCAATAGG TATACAAACA TTCGAGAAGA TAAGAAATAG CAATTATTAC TATGTAGATA AAACAATATT TGTTAAAAAA TTAGAAAATG GTGGGTATTA CTTTCTATCT CGCCCCAGAA GATTTGGCAA ATCTCTTTTT CTTGACACAC TAAAAGAAGC GTTTTCTGGT AACAAAGAAT TATTTAAAGG GCTTTATCTA TATGATAATT GGGATTGGGA TAAGAAATAT CCTATTATAA AAATAAGCTT TGCAAGTGGA AATATAAGAA CTTCAGATAT TTTATTGGAT CTAATGACAT CTCAAGTAAA TAGGATATCA GAAAAAGAAC AGATAAACTT AAAAGAAAAG CGTCCAAGCC AAATGTTTTT AGAACTCATC CAAAAACTAT ACGAAAAATA CAATCAAAAA GTAGTAGTAC TTATAGACGA ATACGATAAA CCGATATTAG ATGCGATAGA GAATATAGAA GTAGCCAGGG AAAACAGAGA AATATTGAAA GATTTTTATT CGGTGTTAAA AGATGCTGAT CCTTATCTAA AACTTGTTTT TCTTACAGGG GTATCGAGGT TTTCAAAAGT AACAATCTTT AGTGGATTAA ATCAGCTTCA GGATATCACT TTAAATGAAG AGTTTTCTAC AGTGTGCGGT TATACCCAAT CTGAGCTTGA AAGTGTATTT GAAGACAGAT TAAAGGATTT TGATAAAGAA AAGATAAAAC AGTGGTATAA CGGCTATAGT TGGCTTGGAG AGAGTGTTTA CAATCCTTTT GATATACTGC TTTTGTTCTC AGAAAAAAGG TTTAGAGCCT TTTGGTTTGA AACAGGGACA CCTACATTTC TTATTAAGAT GTTTATGAAA AACAGATACT ACATACCAGA GCTTGAGAAC CTTGAAGTAG GAGACGAGAT TCTATCAAAC CTTGATGTGG ATAATATAAG GATAGAAAAT CTTTTGTTTC AATCTGGTTA TCTTACAATA AAAGATTTTA AAGAAAAATA CGGAATATAC ACATTGTCCT ATCCAAACTT AGAAGTGAGA AAAAGCTTCA ACAGTTATTT TCTAACCTAT ACAATAGAAG ATATCTCAGC AAAATATAAA ACTGATATAG GTCTAATAGA GGCTTTTGAG AATAAACAAG TAGAAAAACT AAAAGACATC TTACATAGAT TTTTTGCAAG TATACCCCAT GATTGGTACA GGAAAAACGA TATAGACTCC TATGAGGGAT TTTACGCATC TATCGTATAT GCGCTTTTTA ACGGAGCAGG GCTAAATGTA ATAGCAGAAG ACAATACAAA TAAAGGCCAG ATAGATCTTA GTGTCTTTAA CCAAGACAGC GTTTATATCA TAGAGTTTAA GGTAGTAGAA GACAAGGAAG AGGGCGTTGC TTTAAAACAG ATAAAAGATA AAAGGTATTA TGAAAAGTAT ATGGACAAAT ATAATGATAT ATACCTAATA GGGGTAGAGT TTAGTAAGAA AGATAAGAAC ATTGTGGGCT TTGAGTGGGA GAAGTACTGA
|
Protein sequence | MKLLPIGIQT FEKIRNSNYY YVDKTIFVKK LENGGYYFLS RPRRFGKSLF LDTLKEAFSG NKELFKGLYL YDNWDWDKKY PIIKISFASG NIRTSDILLD LMTSQVNRIS EKEQINLKEK RPSQMFLELI QKLYEKYNQK VVVLIDEYDK PILDAIENIE VARENREILK DFYSVLKDAD PYLKLVFLTG VSRFSKVTIF SGLNQLQDIT LNEEFSTVCG YTQSELESVF EDRLKDFDKE KIKQWYNGYS WLGESVYNPF DILLLFSEKR FRAFWFETGT PTFLIKMFMK NRYYIPELEN LEVGDEILSN LDVDNIRIEN LLFQSGYLTI KDFKEKYGIY TLSYPNLEVR KSFNSYFLTY TIEDISAKYK TDIGLIEAFE NKQVEKLKDI LHRFFASIPH DWYRKNDIDS YEGFYASIVY ALFNGAGLNV IAEDNTNKGQ IDLSVFNQDS VYIIEFKVVE DKEEGVALKQ IKDKRYYEKY MDKYNDIYLI GVEFSKKDKN IVGFEWEKY
|
| |