Gene Clim_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2042 
Symbol 
ID6355546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2251422 
End bp2252765 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content51% 
IMG OID642669637 
Productcitrate synthase I 
Protein accessionYP_001944050 
Protein GI189347521 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01798] citrate synthase I (hexameric type) 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTTA CAGAGACAGG AAATTCGCTG ACAATCGTTG ATAACCGGAC AGGAAAATCT 
TATGAGGTTC CGATCGAAAA CGGTTCCATC AACACGATGG AACTTCGTAA AATCAAGGTT
TCCGAAGAGG ATTTCGGGTT GCTGGGATAC GATCCGGGGT ATCTGAATAC CGCATCCTGT
AAAAGCAGAA TTACCTACAT TGACGGCGAC AAGGGGATTC TTCGCTATCG GGGATACCCG
ATCGAGCAGC TTGCCGAAAA GAGCACGTTT CTTGAAACGG CATATCTGCT CATCAAGGGA
GAACTGCCCG ACAAGGAGCG TCTGGCGGTA TGGACCTACA ACATCCGCCA CCATACCATG
ACGCACAACA ATATCGTGAA ATTCATGGAC GGCTTCCGTT ACGACGCCCA TCCGATGGGA
ATACTGGTTG GAACGGTAGG CGCGCTCTCG ACCTTCTACC GCGACGCGAA GGATATCCGG
AACGAAGATT CCCGGAAACT GCAGGTTCGC AGGCTGATCG GCAAGATTCC GACGCTTGCT
GCCATGAGTT TCAGGCACAG CATGGGATTT CCCTATGTAA TGCCGGATAA TGATCTCAGT
TATGCGGGGA ACTTTCTTTC GATGATGTTC AAGATGACGG AGCTTCGATA CAAGCCGAAT
CCGGTACTTG AACGGGCTCT CGATGTTCTG TTCATTCTGC ATGCCGACCA TGAACAGAAC
TGTTCCACAA GCTCCCTGCG GGCTGTCGCA AGTTCAGGAG TCGATCCGTT TTCAGCTATT
GCTGCCGGTT GTGCAGCGCT CTACGGTCCG TTGCACGGCG GAGCGAACGA AGCGGTTATC
CGGATGCTTA TGAAGATCGG ATCGATCGAC AAAATACCGG AATTCATCCA ATCGGTAAAA
GACGGGGATG GCCGTCTGAT GGGCTTTGGT CACAGGGTGT ACAAGAATTA CGATCCGAGA
GCGAAGATTA TCAAGGATAT AGCATTCGAG GTGTTCGAGG AGACCGGCCG TAATCCGATG
CTCGATATTG CGCTTGAACT TGAGAGAATC GCTCTTGAGG ACGACTACTT TGTCAGCAGG
AAACTCTATC CGAATGTCGA TTTTTATTCC GGTCTTATTT ATCAGGCGAT GGGATTCCCC
ATGGATATGT TCCCGGTGCT GTTCGCAATA GGAAGAATTC CCGGATGGCT TGCCCAGTGG
ATCGAACATG TCAAGGACGA CGAGCAGAAA ATCGCCCGTC CCCGGCAGAT CTATCTTGGT
GAAGATGAGC GACAGTTCAT CGCTATGGCA GATCGTCCGA AAACAAGGCT TGACGAGCAG
ATGGCAGGGA TCTGCAGGCT TTAA
 
Protein sequence
MTVTETGNSL TIVDNRTGKS YEVPIENGSI NTMELRKIKV SEEDFGLLGY DPGYLNTASC 
KSRITYIDGD KGILRYRGYP IEQLAEKSTF LETAYLLIKG ELPDKERLAV WTYNIRHHTM
THNNIVKFMD GFRYDAHPMG ILVGTVGALS TFYRDAKDIR NEDSRKLQVR RLIGKIPTLA
AMSFRHSMGF PYVMPDNDLS YAGNFLSMMF KMTELRYKPN PVLERALDVL FILHADHEQN
CSTSSLRAVA SSGVDPFSAI AAGCAALYGP LHGGANEAVI RMLMKIGSID KIPEFIQSVK
DGDGRLMGFG HRVYKNYDPR AKIIKDIAFE VFEETGRNPM LDIALELERI ALEDDYFVSR
KLYPNVDFYS GLIYQAMGFP MDMFPVLFAI GRIPGWLAQW IEHVKDDEQK IARPRQIYLG
EDERQFIAMA DRPKTRLDEQ MAGICRL