Gene MCA0654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0654 
Symbol 
ID3104680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp686941 
End bp687870 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content62% 
IMG OID637169865 
ProductCRISPR-associated Csh2 family protein 
Protein accessionYP_113167 
Protein GI53804985 
COG category[L] Replication, recombination and repair 
COG ID[COG3649] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR01595] CRISPR-associated protein, CT1132 family
[TIGR02589] CRISPR-associated protein, Csd2 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCAGC TCCCCCAAAA TCGCTACGAC TTCGTGCTGT TGTTCGAGGT CAAAGACGGA 
AATCCCAACG GCGATCCCGA TGCAGGAAAC CTGCCGCGTT TGGATGCGGA AACGGGTCAC
GGGTTGGTCA CCGACGTCTG CCTGAAGCGG AAAATCCGCA ATTTCGTCGG CCTGACGCAA
GGCGATGCCG CCCCTTACGA AATCTATGTC AAGGAAAAAG CCGTTCTGAA TCGGCAACAC
GAGCGAGCCT ATCAGGCATT GGGCGTGGAT TTAGGTGCCG ATGAGGGGAA GCGTAAAGGC
GGCGATAAGG TCGATGATGC CCGCCGCTGG ATGTGCCAGA ACTTCTTCGA CGTCCGCACC
TTCGGCGCGG TGATGTCGAC CGGCGTCAAC TGTGGTCAAG TCCGAGGGCC CGTGCAACTC
ACCTTCGCGC GTTCCATCAG CCCCATCGTT GCCCTGGAAC ACTCCATTAC CCGCATGGCG
GTTGCCACTG AGGCGGAAGC GGAAAAGCAG GGCGGCGACA ACCGCACCAT GGGCCGCAAG
CACACCGTGC CCTACGGTCT TTACCGCGCC CATGGCTTCG TGTCGGCCCA TCTCGCCCAA
CAGACCGGTT TTTCCGAAAA GGATCTCGAA TTGCTCTGGC AGGCGTTGAG CCAGATGTTC
GACCACGATC ACTCCGCGGC CCGCGGCGAA ATGGCCACGC GGGGGCTCTA CGTCTTCAAG
CACGTCGGCA CCGATACCGA CCCGGACCAA CGCAAGCAGC AGGCCATGCT CGGTTGCGCG
CCGGCGCACA AGCTGTTCGA TCTGATCCGA GTGGAACCCA AAGACACCGG CCGGCCGCCG
CGCGAGTTTG GGGACTACGC GGTCAGCGCG CCGCCCGCCG GGCCGTTGCC GGCGTTTCCC
GGCGTGGAAC TGATGATCCT CGTGCCATGA
 
Protein sequence
MNQLPQNRYD FVLLFEVKDG NPNGDPDAGN LPRLDAETGH GLVTDVCLKR KIRNFVGLTQ 
GDAAPYEIYV KEKAVLNRQH ERAYQALGVD LGADEGKRKG GDKVDDARRW MCQNFFDVRT
FGAVMSTGVN CGQVRGPVQL TFARSISPIV ALEHSITRMA VATEAEAEKQ GGDNRTMGRK
HTVPYGLYRA HGFVSAHLAQ QTGFSEKDLE LLWQALSQMF DHDHSAARGE MATRGLYVFK
HVGTDTDPDQ RKQQAMLGCA PAHKLFDLIR VEPKDTGRPP REFGDYAVSA PPAGPLPAFP
GVELMILVP