Gene MCA1273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1273 
Symbol 
ID3103507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1356317 
End bp1357633 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content67% 
IMG OID637170452 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_113736 
Protein GI53804375 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTATCG ACACTTTGAT CACCGCGCGC TGGATCATCC CGGTCGAGCC CGACGGGGTG 
ACGCTCGAAC ACCATGCCCT GGCCATCGAC CGCGGCCGCA TCACCGACCT CCTCCCCACC
ACCGAAGCGC TGGTCAGATA CCAGCCGCGG CGGATCGAAC GGCTGGAGCA CCACGCCCTG
ATCCCCGGCC TGGTCAACGC CCACACCCAT GCCGCCATGA CGCTGCTGCG CGGCGTCGCC
GACGACCTGC CGCTGATGCA ATGGCTGCAG GAACACATCT GGCCGCTGGA GCAGAAATGG
ATCGGCGAAG CGTTCGTCCG CGACGGCGTG CAGCTGGCCA TGGCGGAAAT GATCCGGGGC
GGCGTCACCT GCTTCAACGA CATGTACTTC TTTCCCGAAG TGGTGGCGCG CGAAGCGGTG
CGGGCCGGCA TGCGGGCGGC GGTGGGCATG ATCGTGGTGG ACTTCCCCAC CGCCTGGGCC
ACCGACGCGG ATGACTATCT CCGCAAAGGC CTGGCCCTCC GCGACGACTA CCGCCACGAA
CCGCTGATCG CCACGGTATT CGCACCGCAC GCACCCTACA CCGTGAGCGA CGAGCCGCTG
GTCCGCATCC GCACCTGGTC GGAAGAGCTG GACTGCCCCG TGCACATCCA TCTCCACGAG
ACCGCCGACG AAATCCACCG GAGCGGACGG CAGTACGGCA TGCGTCCGCT CAAACGCCTG
GACCAGCTCG GCCTGGTCGG ACCGCACCTG ATCGGCGTTC ACATGACACA ACTGGAAGAC
GGCGAGATCG CACGCCTGGC CGAAACCGGC GCCAGCGTAG TGCACTGCCC CGAATCCAAC
CTGAAGCTGG CCAGCGGCTT CTGCCCCGCC GTCAAGCTGC TGGCGGCGGG CGTCAACGTC
GCGCTCGGCA CCGATGGCGC GGCCAGCAAC AACGACCTGG ACCTGCTCGG TGAAACCCGC
ACCGCGGCCC TGCTGGCCAA GGCGGTAGCC AACGACGCCG CCGCCCTCCC CGCCCACCAG
GCGCTGCGGA TGGCGACCCT GAACGGCGCG GCGGCCTTGG GACTGGGAGC GGAAACCGGC
TCGCTTGTAG TCGGCAAATC CGCCGACGTG GTCGCCATCG GGCTGGAGCA CATCGAATCG
CTGCCGATCT ACAACCCGGT GTCCGACCTG GTCTACGCGG CTGGCCGCCA GCAGGTCACC
GACGTCTGGG TGGCAGGGCG TCAACTGCTG AAAAAGCGCG AGCTGCTGAC GCTGGATGCC
ACGGAAATCC GCGAAAAGAC CCTGATCTGG CGCGACAAAC TGATCCATCA TTCCTGA
 
Protein sequence
MIIDTLITAR WIIPVEPDGV TLEHHALAID RGRITDLLPT TEALVRYQPR RIERLEHHAL 
IPGLVNAHTH AAMTLLRGVA DDLPLMQWLQ EHIWPLEQKW IGEAFVRDGV QLAMAEMIRG
GVTCFNDMYF FPEVVAREAV RAGMRAAVGM IVVDFPTAWA TDADDYLRKG LALRDDYRHE
PLIATVFAPH APYTVSDEPL VRIRTWSEEL DCPVHIHLHE TADEIHRSGR QYGMRPLKRL
DQLGLVGPHL IGVHMTQLED GEIARLAETG ASVVHCPESN LKLASGFCPA VKLLAAGVNV
ALGTDGAASN NDLDLLGETR TAALLAKAVA NDAAALPAHQ ALRMATLNGA AALGLGAETG
SLVVGKSADV VAIGLEHIES LPIYNPVSDL VYAAGRQQVT DVWVAGRQLL KKRELLTLDA
TEIREKTLIW RDKLIHHS