Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1892 |
Symbol | |
ID | 4710690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2079079 |
End bp | 2080467 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639856365 |
Product | ThiS, thiamine-biosynthesis |
Protein accession | YP_001003458 |
Protein GI | 121998671 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATGC AAATAGAGCG GGGGGCCCGA AGTCGGTCCT CTGCGGCCAG TTGCGCAGAT CGCCCCGGGC GGGGTACTCC GCGCCGGCTC CAACAGTGCT CGCTCGGTGT CGTTACAGCG CTAACCATGG CGGTGTCGTC CAGTGTTTCC GCTGCGCCCG GATTGGTCAC GACGCCCGGC ATGGGCCTCG GCGACGGCTC CCACCCAGCG ACGCTGCACA CGGTTGCGGG TAACCCCGCA GGGGCGGCAG CAGCGGATCG GGTCGGTGTC CAGTTTGGGC TTGGCAGCGT AGGCGTGGGC TATGAACTCG GGCCGGTTGA TGGGCTGGTC GATGAAATCG ACGACATCAT CGACATCCTT GACCGGGATG ATCTGGGCCC GAGCGAGGCG GATGACCTGA TTGAACGTAC CGAGGGCGTC CTCGCTCAGC TGGGGCAGGA CGGTTGGGGG AAGCTCCAGT TTGGTGGCCG TCCGCCGCTT GCCCCGCTGG TTGTTGGCAG TGCCCGCTCC GGGTGGTCGG TGGCTCTGGA CGCTGAAGCC ACGGGCCACC TCGGGTTCAG CATACTCGAT GACGAGTTGC GGTTTGTCCG CGAGACCGAG CAAATCCAGA GCAATACCGC GGCATACCTG AAAGGCGGCG GTATCCTCCG GCTCTCTGCA GCCCCGAGCC TGCGCGTTGC AGAGTGGGAG GGAGGGCGCG AGCTGTTCGT CGGTGCTCGG GTCAGCCATT ACCAGGCGGA ACTCTCCAAG GCGGTGGTCG CCCTTGCCGA GGATACGAAC CGGGATTTCG GGGATATCGT CGAGGATGAG ATCGACCGCC AGCAGGAGAC CTCCAGCGCG GTTGGCCTCG ACCTGGGTGT TATGTACCAG ACCCGTTTCT TCCGTGCCGG CGGCGCCTGG AAGAACATCA ATGAACCCAC CTTCGACTTT CCGGCAGTTG GTACCGATTG CTCCGGGGAA GCGGACCCGG ATCTCCAGGC CAACTGCCTC ACGGCGGCCA ATTTCTCAGA CCGGATTACG CGCGAGGAGG CGTTTCGTCT CAACGAGCAG GTGACTCTGG AAGGCGCTCT CCACGATCCC GCCCAGCGGC TCGTCCTGGC CGCCAGTTAC GATGCGAATA CCGTTCGGGA TATCAGCGGC GATGAGTACC AGTGGCTGGC GTTCAGCCTC TCCTACCGCA TGCCGTGGTA TCTGAAGTGG GTTCCCGATC TCCGCGTGGG CTACCGGGAG AATATGAGCG GTTCGGAGCT GAGCTACACC ACTGCCGGCC TGACCTGGCT GGGTGCGGTG ACCCTGGACG TGGCCGTCGC GGATCAGGAC CTGGAACACG ACGGCGAATC GATCCCGCGT AGCGCGATGG CACACCTGGG CTTTCAATTG CGCTTCTGA
|
Protein sequence | MSMQIERGAR SRSSAASCAD RPGRGTPRRL QQCSLGVVTA LTMAVSSSVS AAPGLVTTPG MGLGDGSHPA TLHTVAGNPA GAAAADRVGV QFGLGSVGVG YELGPVDGLV DEIDDIIDIL DRDDLGPSEA DDLIERTEGV LAQLGQDGWG KLQFGGRPPL APLVVGSARS GWSVALDAEA TGHLGFSILD DELRFVRETE QIQSNTAAYL KGGGILRLSA APSLRVAEWE GGRELFVGAR VSHYQAELSK AVVALAEDTN RDFGDIVEDE IDRQQETSSA VGLDLGVMYQ TRFFRAGGAW KNINEPTFDF PAVGTDCSGE ADPDLQANCL TAANFSDRIT REEAFRLNEQ VTLEGALHDP AQRLVLAASY DANTVRDISG DEYQWLAFSL SYRMPWYLKW VPDLRVGYRE NMSGSELSYT TAGLTWLGAV TLDVAVADQD LEHDGESIPR SAMAHLGFQL RF
|
| |