Gene Mext_4680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4680 
Symbol 
ID5834266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5229983 
End bp5231323 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content63% 
IMG OID641370475 
Productcitrate synthase I 
Protein accessionYP_001642119 
Protein GI163854076 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01798] citrate synthase I (hexameric type) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCCG GTCCAGTGCT GTTCCCGTCG CAAGAGAGGG TCCGCGATCC CATGAGCGCT 
TCCGCCAGCA CCATCATCGT GGGCGACAAA AACGTCGAAT TGCCCATCAA GACCGGAACG
ATCGGCCCCG ACGTGGTCGA TATCGGCAAG CTCTACGCCC AGACCGGCAA GTTCACCTTC
GATCCGGGCT TCACCTCGAC CGCCTCCTGC GAGTCGAAGA TTACTTACAT CGACGGCGAC
GAGGGCGTGC TGCTCTATCG CGGCTACCCG ATCGAGCAGC TCGCCGAGAA CGGCGACTTT
CTTGAGACCG CCTACCTGAT GCTGTTCGGC GAGTTGCCCA GCTCGGCCCA GAAGGCGGAT
TTCGACTACC GGGTCACGCG CCACACCATG GTGCACGACC AGATGAACCG CTTCTTCCAG
GGTTTCCGCC GCGACGCCCA CCCGATGGCG GTGATGGTGG CCTGCGTCGG CGCGCTCTCG
GCCTTCTATC ATGACTCGAC CGATATCTCG GACGAGTCGC AGCGGATGAT CGCCTCGCTG
CGCATGATCG CGAAGATGCC GACGCTGGCG GCGATGGCCT ACAAGTACAC GATCGGCCAG
CCCTTCGTGT ATCCGAAGAA CGACCTCGAC TACACCTCGA ACTTCCTGCG GATGTGTTTC
GCGGTTCCGT GCGAGGAATA CGTCGTCAAC CCGATCTACG CCCGCGCGCT GGACAAGATC
TTCATCCTGC ACGCCGACCA CGAGCAGAAC GCCTCGACCT CGACGGTGCG TCTGGCCGGC
TCCTCGGGCG CCAACCCGTT CGCCTGCATC GCCGCCGGCA TCGCCTGCCT GTGGGGGCCG
GCCCATGGCG GCGCCAACGA GGCGGCGCTC AAGATGCTCA TGGAGATCGG GCATCCCGAG
AACGTGCAGA AATACGTCGC CAAGGCGAAG GACAAGAACG ATCCCTTCCG CCTGATGGGT
TTCGGCCACC GGGTCTACAA GAACTACGAC CCGCGTGCGC GCATCATGCA GAAGACGACC
CACGAGGTTC TGAACGAACT CGGGATCAAG GACGACCTCT TGGAAGTCGC CGTCCAGCTC
GAGAAGATCG CCCTCGAGGA CGAGTACTTC ATCGAGAAGA AGCTCTACCC GAACATCGAC
TTCTACTCGG GCATCACCCT CAAGGCGCTC GGCTTCCCGA CCTCGATGTT CACGGTGCTG
TTCGCGCTCG CCCGCACCGT CGGCTGGATC GCGCAGTGGG CCGAGATGAT CGAGGACCCG
TCCCAGAAGA TCGGCCGCCC GCGCCAGCTC TATATCGGCC CGGACCGCCG CGACTACACG
CCGATCGGCC AGCGGAGCTG A
 
Protein sequence
MPSGPVLFPS QERVRDPMSA SASTIIVGDK NVELPIKTGT IGPDVVDIGK LYAQTGKFTF 
DPGFTSTASC ESKITYIDGD EGVLLYRGYP IEQLAENGDF LETAYLMLFG ELPSSAQKAD
FDYRVTRHTM VHDQMNRFFQ GFRRDAHPMA VMVACVGALS AFYHDSTDIS DESQRMIASL
RMIAKMPTLA AMAYKYTIGQ PFVYPKNDLD YTSNFLRMCF AVPCEEYVVN PIYARALDKI
FILHADHEQN ASTSTVRLAG SSGANPFACI AAGIACLWGP AHGGANEAAL KMLMEIGHPE
NVQKYVAKAK DKNDPFRLMG FGHRVYKNYD PRARIMQKTT HEVLNELGIK DDLLEVAVQL
EKIALEDEYF IEKKLYPNID FYSGITLKAL GFPTSMFTVL FALARTVGWI AQWAEMIEDP
SQKIGRPRQL YIGPDRRDYT PIGQRS