Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_0087 |
Symbol | |
ID | 6742870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 80215 |
End bp | 81468 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 642749871 |
Product | CBS domain containing protein |
Protein accession | YP_002120757 |
Protein GI | 195952467 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000089448 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGTA TATTGCTTTA TAGCCTTTTA ATACTTTTTC TCATATTTTT ATCTGGTTTT TTTGCCTCTT CGGAGGTTGT GTTTTTTGGT GCTTCTAATA TATTAGAAGA AAAGGATAAA AAGTTTTATA AAAAACTTAT AGATAGAATA TTTAAAGACC CGCAAGCTCT TTTAGTGTCA ATGCTCATTG GGAACGAGTT TGTAAATGTT TTAATATCTT CAGTGAGCTC TAGTTTTGTA ATAAACCTGG TAGGCAAAAG ATGGGTATTT TTATCTGGCA TATTTATCAG TATATTGATA TTTTTAATTG GTGAAACTAT ACCTAAAAAT ATTGCTCTTT TCTTGAAAGA TAGGCTTTTA AAGTTTTACG CGATCATATT TTATCCATAT CTTGTAGCCA CAAAGCCTTT TACTTACATT TTTGTAGCAC CTGTAAAAAA GATATTAAAA CTTTTTGGTG TTGAAAATAT AAGCTTGGAA AAGAAGTTTT CTTTAGAGCA TATAGTTTAT ATAATGCAAA GTCCAGCAAA CGCTCAGGAA TTTTCAGAAG AAGAAACTCA GATGATACAA AAAGTATCTC AGATGAGAGA AACTATAGTA AGAGAAATAA TGACTCCAAG GCTTGATATA TTTATGTTAG AAGCAACCCA AACTGTAAAG GAAGTTATAA ACGATATATT AGAGCATGAA CATAGTAGGA TACCTATATA CAAAGATACA AAAGACAATG TGGTTGGCTA TATACATATA AAAGACCTTA CGCCAGTTTA TCAGCATAAA GACGATACGT TAGAGATTTT TTTAAGACCT ATTGAATTTA TACCAGAGGT TATGAGTATA AAAAATCTAC TTCAAGAGAT GAAAAAATCT TCAAGTCAAA TAATGATGGT GGTAGATGAG CACGGTGCCA TATCTGGACT CATTACGAAA CATGATTTGC TTGAATGGTT GGCAGGAGAT TTACCTCAGG AGTGGGAAGA TGAAAATGAG TTTACCAAGA TGTCTTCAGA TGTTTATATA GTGGAAGGTT CTGCCTCCAT AGAAGAAGTC GCCGCTACCG TTGGTTTTGA ACTCTCTGAG AACTACGATT ACGATACGTT GAGCGGTTTT ATAATGGCAA ACATGGGGAA AATTCCAAAA GAAGGCGATG AGTTTGAATA CAAAGGTTTT AAGTTTATAG TGGATAAAAT GGATGGTAAA AAGATAGAGC ATGTACTTAT CAAAATACCT ACGGATAAAG AGCCAAATAG CTAA
|
Protein sequence | MNSILLYSLL ILFLIFLSGF FASSEVVFFG ASNILEEKDK KFYKKLIDRI FKDPQALLVS MLIGNEFVNV LISSVSSSFV INLVGKRWVF LSGIFISILI FLIGETIPKN IALFLKDRLL KFYAIIFYPY LVATKPFTYI FVAPVKKILK LFGVENISLE KKFSLEHIVY IMQSPANAQE FSEEETQMIQ KVSQMRETIV REIMTPRLDI FMLEATQTVK EVINDILEHE HSRIPIYKDT KDNVVGYIHI KDLTPVYQHK DDTLEIFLRP IEFIPEVMSI KNLLQEMKKS SSQIMMVVDE HGAISGLITK HDLLEWLAGD LPQEWEDENE FTKMSSDVYI VEGSASIEEV AATVGFELSE NYDYDTLSGF IMANMGKIPK EGDEFEYKGF KFIVDKMDGK KIEHVLIKIP TDKEPNS
|
| |