Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2251 |
Symbol | |
ID | 4709494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2471020 |
End bp | 2472324 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639856727 |
Product | citrate synthase I |
Protein accession | YP_001003817 |
Protein GI | 121999030 |
COG category | [C] Energy production and conversion |
COG ID | [COG0372] Citrate synthase |
TIGRFAM ID | [TIGR01798] citrate synthase I (hexameric type) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGTCGA AAACCGTCAC CGTCACCGAC AACACCACCG GTCACAGCAT CGAGATGCCC ATCCGCGAGG GGACGCATGG GCCGGGCGTC GTCGATATCC AGAACCTCTA CAAGGAGTTC GGATACTTCA CGTACGATGC CGGCTTCACC TCGACCGCGA GCTGCCGCAG CGACATCACC TTCATCGATG GTGAGAACGG CATCCTGCTC TATCGCGGTT ACCCCATCGA AGAGCTCGCC GAGAACAGCA ACTTCCTCGA GGTCTCCTAC CTCCTGCTCC ACGGCGAGCT GCCGACCAAG GACGAGCTGG ACCAGTTCAA CCGGGCGGTC ACCGAGCACG CCATGATCAA CGAGTCGTTG AAGGACTTCT TCGACGGCTT CCACTACAAC GCTCACCCCA TGGCCATGCT CACCGGCGTG GTGGCGTCCC TGTCGGCCTT CTACCACGAC GCCATCAAGC TCGGCGACCC GCGCAACCGG ATGCTCACCT GCCACCGCGT GCTGGCCAAG ATGCCGACCA TCGCCGCCGC GGCCTACAAG CACATGGCAG GCGAGCCGTT CGTCTACCCG CTCAACCGGC TCTACTACAC CGAGAACCTG CTCAACATGC TGTTCTCGCG CCCCACGGAG CCGTACGAGA TCAACCCGAT CCACGCCAGG GCACTGGATC AGCTGCTGAT CCTCCACGCC GACCACGAGC AGAACGCCTC GACCTCCACG GTCCGCCTGG CCAGCTCCAC CGGCACCAAT CCGTTCGCCT CCATCGCCTG TGGTTGCGCT GCGCTGTGGG GGCCCGCCCA CGGCGGTGCC AACGAGGCGG TGCTGAACAT GCTCCACGAA ATCGGCGATA TCAGCCAAGT CCCGAAGTAC ATCGACAAGG CCAAGGACAA GGACGATCCC TTCCGTTTGA TGGGCTTCGG CCACCGGGTC TACAAGAACT TCGACCCGCG GGCGCGGATC ATCCGCAAGA CCTGCCACGA GGTCCTCAAC GACCTGGGCA TCGAGAACGA CCCGCAGCTG GAGCTGGCCA TGGAGCTCGA GGAGCGGGCG CTGGAGGACG ATTACTTCGT CGAGCGCAAG CTCTACCCCA ATGTCGATTT CTATTCGGGG ATTATCTATC GGGCCCTGGG CATCCCCACG GATTTCTTCA CGGTGATGTT CGCCCTCGGC CGCACCCCCG GGTGGTTAGC CCAGTGGAAC GAGATGGTGG CCGATCCGGA GCAGCGCATC GGGCGGCCAC GCCAGCTCTA CACCGGCGCG GCGCGGCGGG AGTACATCCC CGTCGATCAG CGCAAGAAGG GTTAA
|
Protein sequence | MTSKTVTVTD NTTGHSIEMP IREGTHGPGV VDIQNLYKEF GYFTYDAGFT STASCRSDIT FIDGENGILL YRGYPIEELA ENSNFLEVSY LLLHGELPTK DELDQFNRAV TEHAMINESL KDFFDGFHYN AHPMAMLTGV VASLSAFYHD AIKLGDPRNR MLTCHRVLAK MPTIAAAAYK HMAGEPFVYP LNRLYYTENL LNMLFSRPTE PYEINPIHAR ALDQLLILHA DHEQNASTST VRLASSTGTN PFASIACGCA ALWGPAHGGA NEAVLNMLHE IGDISQVPKY IDKAKDKDDP FRLMGFGHRV YKNFDPRARI IRKTCHEVLN DLGIENDPQL ELAMELEERA LEDDYFVERK LYPNVDFYSG IIYRALGIPT DFFTVMFALG RTPGWLAQWN EMVADPEQRI GRPRQLYTGA ARREYIPVDQ RKKG
|
| |