Gene MCA2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2021 
Symbolpgk 
ID3104829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp2169482 
End bp2170657 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content65% 
IMG OID637171176 
Productphosphoglycerate kinase 
Protein accessionYP_114453 
Protein GI53803689 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0126] 3-phosphoglycerate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTCA AACGCATGAC CGACATCGAT CTCGCCGGCA AGCGCGTCCT GATCCGGGAA 
GACTTCAACG TCCCCGTCAA AGATGGCCGG GTCACCAGCG ACGCCCGCAT CCGCGCCGCC
CTCCCGACCA TCCGCCACGC GCTGGACGCC GGCGCTGCGG TGATGCTGAT GTCGCATCTC
GGCCGCCCCA CCGAAGGGGA ATATGCCGAG GAGTTTTCCA TGAAACCCGT TGCCGACCGG
CTGTCCGAAC TATTGGGGCA GCCGGTGACA CTGGTGAAGG ACTACCTGGG TGGCGCCGAC
CCGGCGGTCG GGTCGGTCGT GCTATTCGAG AACGTGCGTT TCAACAAGGG CGAGAAGAAG
GACGACGAGG TCTTGGCGCG CCAGCTCGCC GCGCTGTGCG ACGTCTACGT GATGGATGCC
TTCGGTACGG CGCACCGCGC CGAGGCTTCG ACGCACGGCG TGGGCAAATA CGCTCCCACC
GCCTGTGCCG GCCTGCTGCT GGCGACGGAA TTGGATGCAC TGGGCAGAGC GCTCCACGAC
CCGGCGCGCC CCTTGGTCGC CATCGTCGGC GGCTCGAAAG TATCGACCAA ACTGACGGTT
CTGGATTCCC TTTCGCAGGT GGTCGATCAG CTCATCGTCG GTGGCGGTAT CGCCAACACC
TTCATCAAAG CCGCCGGCTT CAATGTCGGG AAATCACTGT ACGAGGAGGA CCTGGTGGCC
GAAGCCAGGC GCCTGATGGA AGCCGCCAAG GCCAAGGGCG GGGAGATCCC CGTACCGGTC
GACGTGGTGG TCGGCAAACG CTTCGATGCC GCGGAACCCG CCATGGTCAA GAGCGTCGCA
GACATCGCCG AGGACGACAT GATCCTCGAC ATCGGTCCGG AGACCAGCCG CCGCTACGCC
GAGTTCATCG GCCGCGCCGG CACGGTGGTC TGGAATGGCC CCGTAGGTGT CTTCGAATTC
GACCAGTTCG GGGAAGGCAC CCGCCGATTG GGTCTGGCCA TCGCCGAAAG CCATGCATTT
TCCATCGCCG GAGGGGGAGA CACACTGGCA GCCATCGACA AATACGGCAT CGCCGACCGC
ATCTCCTACA TCTCGACGGG GGGCGGCGCC TTCCTGGAAT TTCTCGAAGG CAAGCAACTG
CCGGCGGTTG CCATGCTGGA GAGCCGCGCG GACTGA
 
Protein sequence
MAFKRMTDID LAGKRVLIRE DFNVPVKDGR VTSDARIRAA LPTIRHALDA GAAVMLMSHL 
GRPTEGEYAE EFSMKPVADR LSELLGQPVT LVKDYLGGAD PAVGSVVLFE NVRFNKGEKK
DDEVLARQLA ALCDVYVMDA FGTAHRAEAS THGVGKYAPT ACAGLLLATE LDALGRALHD
PARPLVAIVG GSKVSTKLTV LDSLSQVVDQ LIVGGGIANT FIKAAGFNVG KSLYEEDLVA
EARRLMEAAK AKGGEIPVPV DVVVGKRFDA AEPAMVKSVA DIAEDDMILD IGPETSRRYA
EFIGRAGTVV WNGPVGVFEF DQFGEGTRRL GLAIAESHAF SIAGGGDTLA AIDKYGIADR
ISYISTGGGA FLEFLEGKQL PAVAMLESRA D