Gene MCA1076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1076 
Symbol 
ID3103278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1131896 
End bp1132987 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content62% 
IMG OID637170265 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_113551 
Protein GI53804599 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.901696 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCAGCG TGTACAACAC CGACGATCTT CGCATCTGCG AGATCAAGGA AGTCATTCCG 
CCCGTCCAGG TTCATGAGGA ATTCCCGATC ACGGACCGGG CCGCACTCAC GACACTGACC
GCCCGCCGAG GGATTCACGC AATCCTTTCC AAGGAGGACG ACCGCCTGCT GGTGGTGATC
GGGCCCTGTT CGATCCATGA CCCCAAGGCC GCGCTCGAAT ACGGGGAGCG GCTGCTGCCA
CTCCGCCAGA AACTGGCGAG ACATCTGGAA ATCGTGATGC GGGTCTATTT CGAGAAGCCG
CGAACGACCG TCGGCTGGAA GGGCCTGATC AATGATCCCG ATCTGGACGA GAGTTTCAAC
ATCAACAAAG GCTTGCGCCT CGCCCGCAAG CTGTTGCTCG ATCTGAACGA ACTGGGCATG
CCCGCGGCCA CCGAGTACCT CGATCTCATC ACCCCGCAGT ATGTCTCCGA CCTGATCGCT
TGGGGCGCCA TCGGTGCTCG TACCACGGAG AGCCAGTCTC ACCGTGAACT GGCATCGGGG
CTGTCATGTC CGGTTGGATT CAAGAACGCC ACCGACGGCA CGATCAAGGT TGCTGTCGAC
GCCATAGGTG CGGCACGGCG GCCACATCAT TTCCTGTCTT TGACCAAGGC CGGTCATTCG
GCGATCTTCT CCACGACCGG TAACGCCGAC TGTCACATCA TCCTTCGTGG CGGAGCCCGG
CCGAATTACG ACGCGGCCAG CGTCGAAGCG GCGGCCAGGG CGCTGGAAGC CGTCGGCCTG
CCGCCCAACA TCATGGTGGA CTGCAGCCAT GCCAACAGCA TGAAGGATTA CCTGAAGCAG
CTGCGGGTGG CCGAGGACGT GGCCGAACAG ATAGACGGCG GCGACAGGCG GATCATCGGC
TTGATGGTGG AAAGTCACCT CAAGCCGGGC AATCAGAAAC TCCACAAGGG CATGGTTCCC
GAATACGGCG TCAGCATCAC CGATGCCTGC ATCGGCTGGG ATGACAGCGT GGCCGTGCTG
GAACGGCTCG CCGCCGCGGT GGAGAGCCGG CGCGGCCGGT CGGCAGGCAT CCGGAACGTG
CGGGGGGCCT GA
 
Protein sequence
MPSVYNTDDL RICEIKEVIP PVQVHEEFPI TDRAALTTLT ARRGIHAILS KEDDRLLVVI 
GPCSIHDPKA ALEYGERLLP LRQKLARHLE IVMRVYFEKP RTTVGWKGLI NDPDLDESFN
INKGLRLARK LLLDLNELGM PAATEYLDLI TPQYVSDLIA WGAIGARTTE SQSHRELASG
LSCPVGFKNA TDGTIKVAVD AIGAARRPHH FLSLTKAGHS AIFSTTGNAD CHIILRGGAR
PNYDAASVEA AARALEAVGL PPNIMVDCSH ANSMKDYLKQ LRVAEDVAEQ IDGGDRRIIG
LMVESHLKPG NQKLHKGMVP EYGVSITDAC IGWDDSVAVL ERLAAAVESR RGRSAGIRNV
RGA