Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_0085 |
Symbol | |
ID | 6742868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 78216 |
End bp | 79223 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 642749869 |
Product | putative sulfate-binding protein |
Protein accession | YP_002120755 |
Protein GI | 195952465 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0725] ABC-type molybdate transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000000372808 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG CTTTACTAGC TTTAAGTGTA ATGGCGTTAG GCGCCGCAGC TTTTGCAACC ACAGATTTAA ACCCTTATGA AATGCCACCG CACACACCAC CATGGAGCAG AAAACCAGTT GTTAGCCCAG ATGTTCACAG CGCCAATGAT CCAGCTGTTT ACAAGGGTTT TGATTTTACA ATACCCCCAG TGGATAATGT TGTGGACTTT CACGGCGATT TAAACGCTGC CAACAAAGAT GGTTTGGTAG TGTATGTAGG CGGTAACTAT TATTTTGCAG CTACCAAGCT TGTAAATGCT TTTGAAAAAG AATATCCTCA ATACAAAGGC AAGGTATTTA TAATAACAAT ACCTCCAGGT AAGCTTTTGC AAACTATAGA GCATTACCAC GATACATTTA CTTTGGGTGA TATGACTTTC ACTGCCAAAC CAGATATATT TGCTGGCGGT GCTTTTGGCG TGAAAAAATG TATAAAAGAT GGTTTTTGCG AAGCTTCTAC ATTTACCCCA TACGTTACAA ACACACTTGC TATAGAAGTC TACAAAGGCA ATCCAAAGCA TATACATAGC CTCCAAGACT TAGCAAGAAA AGATGTGAAA GTAATCATGC CAAATCCAAA GTTTGAAGGT ATAGCAAGAA GAATAGAAGA CGCCCTTGAA AAATGTGGTG GTAAGAAGTT AAAAGATGCC ATATTTGGTC CACACGGTAA GGCTATATTG ACAGAAATAC ACCACAGACA AACCCCACTT GCAATTATGG AACATATAGC TGATGCCGGT GTTGTATGGC ACTCTGAAGT AGAAGCTCAA ATGAAACAAT ACCACAATCC TCTTGAAGCT GTGCCAATAC CAGCAAAATG CAATTACACG GCTATATACG CTGCTGCTGA GGTTAAAGGA GCTCCACATC CACAAGCAGC TAAAGATTGG TTGCACTTCT TAAGGACACC AGCCGCTCTA AAGGTATTTG AATTCTACGG CTTCAAACCC TACGAAGGAA AAAGATAA
|
Protein sequence | MKKALLALSV MALGAAAFAT TDLNPYEMPP HTPPWSRKPV VSPDVHSAND PAVYKGFDFT IPPVDNVVDF HGDLNAANKD GLVVYVGGNY YFAATKLVNA FEKEYPQYKG KVFIITIPPG KLLQTIEHYH DTFTLGDMTF TAKPDIFAGG AFGVKKCIKD GFCEASTFTP YVTNTLAIEV YKGNPKHIHS LQDLARKDVK VIMPNPKFEG IARRIEDALE KCGGKKLKDA IFGPHGKAIL TEIHHRQTPL AIMEHIADAG VVWHSEVEAQ MKQYHNPLEA VPIPAKCNYT AIYAAAEVKG APHPQAAKDW LHFLRTPAAL KVFEFYGFKP YEGKR
|
| |