Gene Hhal_2251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2251 
Symbol 
ID4709494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2471020 
End bp2472324 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content63% 
IMG OID639856727 
Productcitrate synthase I 
Protein accessionYP_001003817 
Protein GI121999030 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01798] citrate synthase I (hexameric type) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTCGA AAACCGTCAC CGTCACCGAC AACACCACCG GTCACAGCAT CGAGATGCCC 
ATCCGCGAGG GGACGCATGG GCCGGGCGTC GTCGATATCC AGAACCTCTA CAAGGAGTTC
GGATACTTCA CGTACGATGC CGGCTTCACC TCGACCGCGA GCTGCCGCAG CGACATCACC
TTCATCGATG GTGAGAACGG CATCCTGCTC TATCGCGGTT ACCCCATCGA AGAGCTCGCC
GAGAACAGCA ACTTCCTCGA GGTCTCCTAC CTCCTGCTCC ACGGCGAGCT GCCGACCAAG
GACGAGCTGG ACCAGTTCAA CCGGGCGGTC ACCGAGCACG CCATGATCAA CGAGTCGTTG
AAGGACTTCT TCGACGGCTT CCACTACAAC GCTCACCCCA TGGCCATGCT CACCGGCGTG
GTGGCGTCCC TGTCGGCCTT CTACCACGAC GCCATCAAGC TCGGCGACCC GCGCAACCGG
ATGCTCACCT GCCACCGCGT GCTGGCCAAG ATGCCGACCA TCGCCGCCGC GGCCTACAAG
CACATGGCAG GCGAGCCGTT CGTCTACCCG CTCAACCGGC TCTACTACAC CGAGAACCTG
CTCAACATGC TGTTCTCGCG CCCCACGGAG CCGTACGAGA TCAACCCGAT CCACGCCAGG
GCACTGGATC AGCTGCTGAT CCTCCACGCC GACCACGAGC AGAACGCCTC GACCTCCACG
GTCCGCCTGG CCAGCTCCAC CGGCACCAAT CCGTTCGCCT CCATCGCCTG TGGTTGCGCT
GCGCTGTGGG GGCCCGCCCA CGGCGGTGCC AACGAGGCGG TGCTGAACAT GCTCCACGAA
ATCGGCGATA TCAGCCAAGT CCCGAAGTAC ATCGACAAGG CCAAGGACAA GGACGATCCC
TTCCGTTTGA TGGGCTTCGG CCACCGGGTC TACAAGAACT TCGACCCGCG GGCGCGGATC
ATCCGCAAGA CCTGCCACGA GGTCCTCAAC GACCTGGGCA TCGAGAACGA CCCGCAGCTG
GAGCTGGCCA TGGAGCTCGA GGAGCGGGCG CTGGAGGACG ATTACTTCGT CGAGCGCAAG
CTCTACCCCA ATGTCGATTT CTATTCGGGG ATTATCTATC GGGCCCTGGG CATCCCCACG
GATTTCTTCA CGGTGATGTT CGCCCTCGGC CGCACCCCCG GGTGGTTAGC CCAGTGGAAC
GAGATGGTGG CCGATCCGGA GCAGCGCATC GGGCGGCCAC GCCAGCTCTA CACCGGCGCG
GCGCGGCGGG AGTACATCCC CGTCGATCAG CGCAAGAAGG GTTAA
 
Protein sequence
MTSKTVTVTD NTTGHSIEMP IREGTHGPGV VDIQNLYKEF GYFTYDAGFT STASCRSDIT 
FIDGENGILL YRGYPIEELA ENSNFLEVSY LLLHGELPTK DELDQFNRAV TEHAMINESL
KDFFDGFHYN AHPMAMLTGV VASLSAFYHD AIKLGDPRNR MLTCHRVLAK MPTIAAAAYK
HMAGEPFVYP LNRLYYTENL LNMLFSRPTE PYEINPIHAR ALDQLLILHA DHEQNASTST
VRLASSTGTN PFASIACGCA ALWGPAHGGA NEAVLNMLHE IGDISQVPKY IDKAKDKDDP
FRLMGFGHRV YKNFDPRARI IRKTCHEVLN DLGIENDPQL ELAMELEERA LEDDYFVERK
LYPNVDFYSG IIYRALGIPT DFFTVMFALG RTPGWLAQWN EMVADPEQRI GRPRQLYTGA
ARREYIPVDQ RKKG