Gene MCA1769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1769 
Symbol 
ID3104231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1900555 
End bp1901823 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content59% 
IMG OID637170930 
Productcytochrome c peroxidase family protein 
Protein accessionYP_114208 
Protein GI53804187 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.391273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTCA TGAAAATATC GAGTCACTTC GTGGTTTTCT TACTGGCTGC GGTTGCCGGC 
ACTCTGGAAG GACATGCGGC TACCGCACTG ACGCCTGGAG AACTGCTCGG CAAGGCGCTC
TTTTTCGACC CCAGCCTTTC GACCCCGCCA GGACAGTCCT GCGCCGACTG TCATGATCCC
AAGGCCGGAT GGACCGGCTC GGACCAGGAC ATCAATCTGC ACGGCGGTGT TTATGAAGGT
GCGGTTGCAA CACGCTTCGG TAACCGCAAA CCACCGACCG CCGCCTATGC TTCGTTCAGC
CCGAAATTTC ACCGCGACGG TAACGGGGAG TTCGTGGGCG GAAACTTCTG GGACGGCAGA
GCGACGGGGG AAAGGCTCGG AAATCCCGCG GCGGATCAGG CCCAGGGACC ATTCCTCAAC
CCTTTGGAGC AGAATGATCC CAGTGCGGCG GACGTTTGCC GGAAGGTGGC GGCTTCCGGT
TTCGCCGCCC AGCTCACGGG TTCAAGCTAT CCCGATCTGT TCGCGCGCGC CTTCGGGCCG
GGCACGTTGG ACTGTGATAA TTCATCCGAT ACTTATGACC GCATCGCTCT GGCCATTGCC
GCCTATGAGG CTTCCAGGGA GGTCAGTTCA TTCAGCTCGA AGTACGATGC ATATCTGAGA
GGGAGGGCGG TGTTGACGAA ACAGGAGAAG AAGGGGATGG CATTGTTTGA GGGAAAAGCG
AAGTGCGCGA ATTGCCATTC CACGAGGGGC ATGAGCTACG CCGGCAAATT TCCCCTTTTC
ACCGATTTCA CCTATGTCAA CACCGGTGTT CCCAGAAACC CGGAAAACCC TTTCTATCAG
ATGCCCGCCG AGTTCAATCC GCTAGGCGCG GACTGGGTGG ACCCGGGACT GGGCGGATTC
CTGGCTGGCC GGGTGGAGTA CGCACCGTAC GCGGCCGATA ACAAAGGTAA GCAGAAAGTT
CCCACCTTGC GGAATGTCGA CAAGCGCCCA TCCTTGGCAT ACCTGAAGGC ATACATGCAC
AACGGCGCAT TCAAGAGTCT GAAGGAAGTG GTTCACTTCT ATAACACGCG CGATGTGCTG
GCCGCCTGTG AGCACCTTTC CCATCCGGAG CCCGGCATCA ACTGCTGGCC GGCTGCGGAA
GAGGCAGCCA ATGTCAACCG GACGGAAACG GGCGATTTGA AATTGTCGGA TGAGGAGGAG
GATGCCATCG TCGCGTTTCT GAGGACGCTG TCCGACGGCT TCCAGCTTTC AGGCCCAACG
GCCGACTGA
 
Protein sequence
MRFMKISSHF VVFLLAAVAG TLEGHAATAL TPGELLGKAL FFDPSLSTPP GQSCADCHDP 
KAGWTGSDQD INLHGGVYEG AVATRFGNRK PPTAAYASFS PKFHRDGNGE FVGGNFWDGR
ATGERLGNPA ADQAQGPFLN PLEQNDPSAA DVCRKVAASG FAAQLTGSSY PDLFARAFGP
GTLDCDNSSD TYDRIALAIA AYEASREVSS FSSKYDAYLR GRAVLTKQEK KGMALFEGKA
KCANCHSTRG MSYAGKFPLF TDFTYVNTGV PRNPENPFYQ MPAEFNPLGA DWVDPGLGGF
LAGRVEYAPY AADNKGKQKV PTLRNVDKRP SLAYLKAYMH NGAFKSLKEV VHFYNTRDVL
AACEHLSHPE PGINCWPAAE EAANVNRTET GDLKLSDEEE DAIVAFLRTL SDGFQLSGPT
AD