Gene GM21_3990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3990 
Symbol 
ID8139364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4573008 
End bp4574372 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content60% 
IMG OID644871606 
Producttype I citrate synthase 
Protein accessionYP_003023764 
Protein GI253702575 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01793] citrate (Si)-synthase, eukaryotic 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.000000256962 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACACAGC TGAAAGACAG GCTAAAGGAA AAGATCGAGG CTCACCGTCC CCGCATCGCC 
CGGCTCACTA AAGAGTTCGG CTCAGTCATA ATCGACAAGG TCGATATAGC GCAATGCATC
GGCGGCGCCC GCGATATTAG GTCCCTTGTT ACTGACATCT CCTATCTTGA TCCGCAGGAA
GGGATCCGTT TCAGAGGCAA GACCATCCCC GAGACTTTCG AGGCCCTCCC CAAGGCAGCC
GGTTCAGAGT ATCCCACAGT GGAATCGTTC TGGTATTTCC TGCTCACCGG CGAGGTTCCG
ACCCCCGAGC AGGTGCAGGA CGTGGAAGCC GAATTCAAGA CGCGACAGCA GGTTCCGGAG
TACGTGTTCC AGTCCTTGCG GGCGCTCCCG CTGGACAGCC ACCCAATGGT GATGCTCGCC
TCCGGCATCC TCGCCATGCA AAGGGATTCC AAGTTCGCAG CCTTCTACAG CAGCGGCAAG
TTCAACAAAA TGACGGCCTG GGAGCACGTC TACGAGGACG CCAGCGACAT CGTGGCCCGC
ATCCCGGTAC TGGCAGCATT CATCTACAAC CTCAAGTACC GGGACGACAA GCAGATCTCC
ATCGACCCGA AGCTGGACCT GGGCGCCAAC TTCGCCCAGA TGATAGGGCA GAGCGAGCAG
TACAAGGATG TGGCACGCAT GTACTTCATC CTCCACTCCG ACCACGAGTC GGGCAACGTC
TCGGCCCACG CCACCCACCT CGTCCACTCT GCCCTTTCCG ACCCCTATTA CGCCTATGCC
GCAGGTCTCA GCGGCTTGGC CGGCCCTCTT CACGGCCTGG CGAACCAGGA GGTACTAGGG
TGGATCCTGG AATTCCAGAA GAAGCTCAAC GGCGCCGAGC CGACCATGGA AAACGTCACG
GCGGCTCTTT GGGACACCCT CAATGCCGGG CAGGTGGTCC CGGGTTACGG GCACGCCGTC
CTCAGGAAGA CCGACCCGCG CTACATGGCC CAGCGCGAGT TCTGCCTGAA GACCACGGGG
CTTAAGGACG ACAAGCTCTT CAAGCTGGTC TCCATGATCT TCGAGACCGC TCCGGGTGTC
CTTACCGAAC ATGGCAAGAC CAAGAACCCG TGGCCCAACG TGGATGCGCA ATCGGGCGTG
ATCCAGTGGT ACTACGGGCT GAAAGAATGG GATTTTTACA CGGTGCTCTT TGGAGTGGGG
CGCGCCTTGG GATGCATGGC GAACATCACG TGGGACCGTG GCCTTGGCTA CCCCATCGAG
CGACCCAAAT CCGTCACCAC CGAGATGCTG GAGACCTGGG CTGCGGCAGG TGGACGGGAT
ATCACAGCCG CCACAATTCA GCAACCGCCA AAGCCAACTG CGTAG
 
Protein sequence
MTQLKDRLKE KIEAHRPRIA RLTKEFGSVI IDKVDIAQCI GGARDIRSLV TDISYLDPQE 
GIRFRGKTIP ETFEALPKAA GSEYPTVESF WYFLLTGEVP TPEQVQDVEA EFKTRQQVPE
YVFQSLRALP LDSHPMVMLA SGILAMQRDS KFAAFYSSGK FNKMTAWEHV YEDASDIVAR
IPVLAAFIYN LKYRDDKQIS IDPKLDLGAN FAQMIGQSEQ YKDVARMYFI LHSDHESGNV
SAHATHLVHS ALSDPYYAYA AGLSGLAGPL HGLANQEVLG WILEFQKKLN GAEPTMENVT
AALWDTLNAG QVVPGYGHAV LRKTDPRYMA QREFCLKTTG LKDDKLFKLV SMIFETAPGV
LTEHGKTKNP WPNVDAQSGV IQWYYGLKEW DFYTVLFGVG RALGCMANIT WDRGLGYPIE
RPKSVTTEML ETWAAAGGRD ITAATIQQPP KPTA