Gene GM21_3947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3947 
Symbol 
ID8139321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4531531 
End bp4532532 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content64% 
IMG OID644871564 
Productcitrate lyase ligase 
Protein accessionYP_003023722 
Protein GI253702533 
COG category[C] Energy production and conversion 
COG ID[COG3053] Citrate lyase synthetase 
TIGRFAM ID[TIGR00124] [citrate (pro-3S)-lyase] ligase
[TIGR00125] cytidyltransferase-related domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.0000190715 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCACTT CCCTTCTATC AACAGATGAT CTGAAGCAGG CGCAGGCACT GGTAGCGGAA 
AGCGGCCTGC GCTTCGAGCT TCCCTACGAC GACCTGGTCG GGGTGTTCGA GGCCGGGCGG
CTGGTGGCGG TCGGCGCCAG GCAGGGGCGC GTACTCAAGA TGCTCGTCGT GGCGCCCGCG
CACCAGGGGG GGAGCCTGCT CGACGAGGTG GTGACGGAGC TGGTGGGACG CGGCTACCAG
GAGGGGATGG ATTCCTTCTT CGTCTTCACC TCGCCGGCAC TCGCCCCGAG CTTCGAATCG
CTCAACTTCA ACCTTTTGGT CACCTCGGGA AAGACCGCGC TTTTGGAGTA CGGCGACGGG
CTGAGGCGCT ACCTGGCCCG GTACAGCCGG CAGGTGTTTC CCGGCAACAA CGGCGCCGTG
GTGGCCAACT GCAACCCCTT CACCGTGGGG CACCGCTACC TGGTGGAAGA AGCCGCCTCC
GTCGTCGACC ATCTCTACCT CTTCGTGGTG CGCGAGGAGC GCTCCCTGTT CCCATTCCCG
GCGCGCCTGC GGATGGTGCG GGAAGGGACC GCCGATCTGA AAAACGTCAC CGTCCTCGAT
ACCTCCTGGT ACGCAGTCTC CAGCGTCACC TTCCCCACCT ATTTCCTGAA ATGCGACGAC
CCGGTGGGCG CCATCCAGAT GGAGCTCGAC CTGCTCCTCT TCGCCACCCG CATCGCGCCC
TATTTCCATA TCGCCACCCG CTTCATCGGC TCCGAGCCGT TCAGCCGCAC CACGGCGGAA
TATAACCGCG CCATGCACAG GATTCTCCCC CCAATGGGGA TCGGGGTGCG GGAGCTGGAA
AGAAAGAGCG CCTTTGGCGC GGCGGTGAGC GCCTCAAGGG TGCGGGAGAT GCTGATGGCA
GGCGAACTGG AGGGGATCGC CGAGCTGGTG CCGGTGAGCA CGCTCGATTT TCTCCTCTCC
AGCGAGGGGA TCAAGATCTG GGACAAAGGG GGGAGCAAAT GA
 
Protein sequence
MVTSLLSTDD LKQAQALVAE SGLRFELPYD DLVGVFEAGR LVAVGARQGR VLKMLVVAPA 
HQGGSLLDEV VTELVGRGYQ EGMDSFFVFT SPALAPSFES LNFNLLVTSG KTALLEYGDG
LRRYLARYSR QVFPGNNGAV VANCNPFTVG HRYLVEEAAS VVDHLYLFVV REERSLFPFP
ARLRMVREGT ADLKNVTVLD TSWYAVSSVT FPTYFLKCDD PVGAIQMELD LLLFATRIAP
YFHIATRFIG SEPFSRTTAE YNRAMHRILP PMGIGVRELE RKSAFGAAVS ASRVREMLMA
GELEGIAELV PVSTLDFLLS SEGIKIWDKG GSK