Gene Gmet_2375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_2375 
Symbol 
ID3740052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp2684673 
End bp2685740 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content61% 
IMG OID637779667 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_385325 
Protein GI78223578 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000189988 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGA CAAGCAACCT GAAGGTCACG AGCATTACCC CCATCATCGC CCCTGCCGAC 
CTGCGGCAAG TTTTCCCCCA ATCGCTGGAA ACAGCTGAAT TCGTTAATGC GAGCCGGGCC
CACATCAAGA ACATCCTCAA GGGGAAAGAC ACCCGCCTCA TGGTGGTGGT GGGCCCCTGT
TCCATTCACG ACCCCAAGGC CGCCCTCGAC TATGCGGGGC GCCTTGCGCG ACTCGCCAGC
GAACTCTCCG ACCAGCTTTT CATCGTGATG CGGGTCTACT TCGAGAAGCC CCGCACCACC
ATCGGCTGGA AGGGGCTCAT CAACGACCCC GACATGAACC ACACCCACCA GATCTCCAAG
GGACTCGGCA TCGCGCGGCG GCTCCTGAAC GACATCACCA GCATGCTCCT CCCCGTGGCG
TGCGAGATGC TCGATACCAT CACCCCTGAA TACCTGGCCG ACTATATCTC GTGGGGCGCC
ATCGGCGCCC GGACCACCGA GAGCCAATCG CACCGGGAGA TGGCGAGCGG CCTTTCCTTC
CCCGTGGGCT TCAAAAATGG CACTGACGGC AACCTGCAGA TCGCCATCGA CGCCATGAAC
GCGGCGCTCC ACCCCCACAG CTTCCTCGGC ATCAATCGGG ATGGCAAGAC CTCCATCATC
CAGACCACCG GCAACCCGGA CGTGCACATC GTCCTGCGTG GCGGCAAGAA GCCCAACTAC
TCTCCCGAGG ACATCGCCAA GACCGAAGAG ATGGTTGAAA AGGGGGGTAT CTTCCCGACC
ATCATGGTTG ATTGCAGCCA CGGCAACTCG GAGAAGCGCC ACGAGAAGCA GCCGGAGGTG
CTTGACAGCA TCGTCGACCA GATCGAGGCG GGCAATCGCT CCATCTCGGG GGTCATGATC
GAGAGCTTCC TCGAAGCGGG GAACCAGCCC ATTCCCAAGG ATCTGTCCCA ACTCCGCTAC
GGGGTCTCCA CCACCGACAA GTGCATCGAC TGGAAGACCA CCGAGGAAAT CCTGCGCAAG
GCCCATGAAC GGCTCAAGCG CTGCGGCGGA AGACCGATGC ACGGTTGA
 
Protein sequence
MTKTSNLKVT SITPIIAPAD LRQVFPQSLE TAEFVNASRA HIKNILKGKD TRLMVVVGPC 
SIHDPKAALD YAGRLARLAS ELSDQLFIVM RVYFEKPRTT IGWKGLINDP DMNHTHQISK
GLGIARRLLN DITSMLLPVA CEMLDTITPE YLADYISWGA IGARTTESQS HREMASGLSF
PVGFKNGTDG NLQIAIDAMN AALHPHSFLG INRDGKTSII QTTGNPDVHI VLRGGKKPNY
SPEDIAKTEE MVEKGGIFPT IMVDCSHGNS EKRHEKQPEV LDSIVDQIEA GNRSISGVMI
ESFLEAGNQP IPKDLSQLRY GVSTTDKCID WKTTEEILRK AHERLKRCGG RPMHG