Gene GM21_3943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3943 
Symbol 
ID8139317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4528351 
End bp4529889 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content67% 
IMG OID644871560 
Productcitrate lyase, alpha subunit 
Protein accessionYP_003023718 
Protein GI253702529 
COG category[C] Energy production and conversion 
COG ID[COG3051] Citrate lyase, alpha subunit 
TIGRFAM ID[TIGR01584] citrate lyase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.000277042 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGCTCA ATAGCCTGGG CCGCGAGATT CCGGAGAGCT ACGCGGGGAG AAGGCTCGTC 
CCCTACGGCG ACCCCTACTC GATCACCCCG AGCGGGTGCG TCGCCGCCCG CCGCCTGAGG
CGCGTGAACC CCGGCGCCTC CAAGCTCCTC TCCGGCCTCA GGGAGGCGAT CGAGGCGAGC
GGCCTGAAAG ACGGCATGAC CATCGCCACC CACCACAGCC TCCGAAACGG CGACTTCCTT
TTGAACCGGC TGGTGGCCGA GATCGCCCAG ATGGGGATCC GGGGGATCTG GATCGCCTCC
TCCTCGGTGC ACCCGGTGCA CGCAGAGATC ATCCCTCACA TCAAAAGCGG CGTGATAGCC
GGCTTCCAAT GTGGCGTGAA CGGCCTGATC GGCGAGATGG CGAGCCGGGG GGAACTCTCC
TGCCCCATCG TGGTCCGGAC CCACGGCGGC CGGGCCCGCG CCATCATGGA GGGGTCGGTG
CAGGTGGACG TCGCGTTCAT CGCCGCCCCC TGCTGCGACG AGTACGGCAA CATGAACGGC
TACAGCGGCC CCTCCGCCTG CGGCAGCCTG GGGTACGCCC AGACCGACGC CCTGCACGCC
GGCTGCGTCG TCGCCGTCAC CGACAACCTG GTCCCCTTCC CGGTGGTGCC GGTCAGCATC
CCGCAGACCC TGGTGGACTA CGTGGTGACG GTGGACCGGC TGGGGGACCC GGCGAAGATC
GTCTCCACCA CCACCAGGAT CACCACGGAT CCGGTCGGGC TCCAGATCGC CGGCTACGCC
TCACAGGTGA TCGAGGCCTC CGGCCTCTTG AAGGACGGCT TCTCCTTCCA GACCGGCAGC
GGCGGCATCT CTCTCGCCGT CTCGGACAAG GTGAGAGGCG CCATGCGCCG CGGCAACATC
AAGGGGAGCT TCGGCTGCGG CGGCATCACC GGATACTTCG TGGAGATGCT GGAAGAGGGG
CTCTTCGGCG CGCTGATGGA CGTGCAGTGC TTCGACCAGG AGGCGGTGAA GTCGATAGCG
AAAAACCGAG CCCACCAGGA GATCGGCGCC GACATGTACG CGAACCCCTT CAACGCGGGG
GCCGTGGTGA ACCGGCTCGA CTGCGTGATC CTCGGGGCGA CCGAGGTGGA CACCTCCTTC
AACGTCAACG TGAACACGGA GTCCAACGGC TACCTGCTGC ACAACACCGG CGGCCACTCC
GACACGGCCG CCGGGGCGAA GCTTTCCATC ATCGTGGCCC CCTCCATCCG CGGGCGCCTC
CCCATAGTGC GCGACCGGGT CACCACCGTC ACCACCCCCG GCGAGACCAT AGGCGTAGTG
GTGACCGAGC GGGGGATCGC GGTGAACGAC CGGCACCCCG AGCTCAAGGA GGAGCTTGTC
AGGAGGAAGC TGCCGGTCAA AGAGATCGGC GAACTGCAGC GCGAGATCTG CCGGGTGACC
GGCACCCCGC AGCCGCTTCA GTTCGAGGAC CAGGTGGTGG CGGTGATCGA GTACCGGGAC
GGGAGCATCA TCGATGTCGT TAGACGCGTT AAGGAATAG
 
Protein sequence
MALNSLGREI PESYAGRRLV PYGDPYSITP SGCVAARRLR RVNPGASKLL SGLREAIEAS 
GLKDGMTIAT HHSLRNGDFL LNRLVAEIAQ MGIRGIWIAS SSVHPVHAEI IPHIKSGVIA
GFQCGVNGLI GEMASRGELS CPIVVRTHGG RARAIMEGSV QVDVAFIAAP CCDEYGNMNG
YSGPSACGSL GYAQTDALHA GCVVAVTDNL VPFPVVPVSI PQTLVDYVVT VDRLGDPAKI
VSTTTRITTD PVGLQIAGYA SQVIEASGLL KDGFSFQTGS GGISLAVSDK VRGAMRRGNI
KGSFGCGGIT GYFVEMLEEG LFGALMDVQC FDQEAVKSIA KNRAHQEIGA DMYANPFNAG
AVVNRLDCVI LGATEVDTSF NVNVNTESNG YLLHNTGGHS DTAAGAKLSI IVAPSIRGRL
PIVRDRVTTV TTPGETIGVV VTERGIAVND RHPELKEELV RRKLPVKEIG ELQREICRVT
GTPQPLQFED QVVAVIEYRD GSIIDVVRRV KE