Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_0344 |
Symbol | |
ID | 5207279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 440745 |
End bp | 441551 |
Gene Length | 807 bp |
Protein Length | 268 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640593970 |
Product | histidine triad (HIT) protein |
Protein accession | YP_001274726 |
Protein GI | 148654521 |
COG category | [F] Nucleotide transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0537] Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.448559 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACTTA TTTTCCATTC CCCGAAGAAT CCTCACAGCC ATATGACTGC AAAAGAACGC AACCGAGAGT CATTTCCGAT AAGGTACTTA TTTCAAAAGG GTTTGATAAA CGGTCGTGTT TTAGATTTTG GGTGTGGTCT GGGATCAGAT GTAAGGTTCC TTCGACAAAA AGGATACAAC GTTACTGGCT ACGATCCTCA TTATGCTCCA GACCAACCGA AGGGCAAGTT TGATACCATT GTTTGTATCT ATGTATTGAA CGTTCTTCTA CCGAAAGAGC AAGAGTACGT CCTGATGTCC ATTTCTGAAC TGCTTCTTCC AGAGGGAAGC GCATACTTTG CCGTTCGACG TGATATTCGA CGTGATGGTT TTCGTCTTCA CCTGAAACAT CGTGTAGAGG TGTATCAGTG CTCAGTTACG CTCCCCTATC AAAGTATTCT TCGCACCGAT CATTGTGAAA TATACCACTA TCGCCACTAT AATCAACTAT CTTCGCCTTC TACACCTTCG ACATGCCCAT TCTGCACACC TGCAAATGAT TATGAATTAC TGACCGAGTC TGCCAATGCA TATGCTGTGT TAGTCAGACG CCCGATTTCC CCTGGTCACA CGCTTGTCAT CCCCAAGAAA CACTTCACCC GTTATCTTCA AATTCCGCAT TATACCGTAG AATCTTACTG GTCAGTAGTA GAACGGGTAA AGCAAATACT AAACGAACGT TTTTACCCCA AACATTTCAA CATAAAAGTT GATACCGATA CAGTCGCAGG TCATGTATAT ATTCATATTA TCCCACAGTA CGCCTAG
|
Protein sequence | MELIFHSPKN PHSHMTAKER NRESFPIRYL FQKGLINGRV LDFGCGLGSD VRFLRQKGYN VTGYDPHYAP DQPKGKFDTI VCIYVLNVLL PKEQEYVLMS ISELLLPEGS AYFAVRRDIR RDGFRLHLKH RVEVYQCSVT LPYQSILRTD HCEIYHYRHY NQLSSPSTPS TCPFCTPAND YELLTESANA YAVLVRRPIS PGHTLVIPKK HFTRYLQIPH YTVESYWSVV ERVKQILNER FYPKHFNIKV DTDTVAGHVY IHIIPQYA
|
| |