Gene Synpcc7942_1863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1863 
Symbol 
ID3775226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1933701 
End bp1934582 
Gene Length882 bp 
Protein Length293 aa 
Translation table11 
GC content59% 
IMG OID637800304 
ProductHemK family modification methylase 
Protein accessionYP_400880 
Protein GI81300672 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2890] Methylase of polypeptide chain release factors 
TIGRFAM ID[TIGR00536] HemK family putative methylases
[TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACCA CTACCTGGCA GGCTGTGCTC ACTTGGCGAT CGCACCAGCA GCAACTGGCG 
CCGGATATCG ATCGCCAAGA ATTGGACTGG CTCCTCCGGG AAGTGGCCGG CGTTCCGCTA
GAACGTCAAC GCTGGGCAGC CCCAGGCGAT CGCCTTGAGC TACGTTGCCC ACTAGCAGCG
ATCGCGGATC TCTGGCAACA ACGGATCCGA CAGCGCTGTC CGGTGCAGTA TCTGGCAGGT
CATGCGCCTT GGCGCGACTT GGAGTTGCAG GTTTCCCCCG CTGTCCTGAT TCCCAGGCCA
GAAACAGAGC TGATCATCGA CCTAGCAATC GCTTGGTCCC AAGCAGAACC AGCCCGACAA
ACAGGCTTCT GGGCGGATTT AGGGACTGGC AGCGGTGCGA TCGCGATCGC GTTAGCGCGG
GCACTACCCC AAATCACCGT CCTTGCCGTC GATGTCAGTG CTGAGGCTCT GGCGATCGCC
CGGAACAATG CAGCCCGCTA TGGTTTAAGC GATCGCATCC GCTGGTATCA GGGCAGTTGG
TTGGTGCCTT TGGCCGACTA TCGAGGTCAA CTGCAGGCAA TTATCTCCAA TCCGCCCTAC
ATTCCCACTC AAGAGTGGCA AGCCCTAGAG CCGGAAGTCC GCGATCATGA ACCGCGTCAA
GCTCTGGAGT CTGGCCCTGA TGGGTTAGAA GCGCTACGCC ATTTAGCCCA AGCGGCGCCT
GACTATTTGC GATCGCTCGG TCTGTGGCTC TGCGAACACA TGGCCGGTCA AAGTACCGCT
GTAACGGCTT TGCTGGCGGC CATTCCTGGC TATTCTGAGA TCCAAAGTCA TCGCGATTTA
GCGGGCCGCG ATCGCTTTGT TTCGGCCAGT TGGAGTGCTT GA
 
Protein sequence
MATTTWQAVL TWRSHQQQLA PDIDRQELDW LLREVAGVPL ERQRWAAPGD RLELRCPLAA 
IADLWQQRIR QRCPVQYLAG HAPWRDLELQ VSPAVLIPRP ETELIIDLAI AWSQAEPARQ
TGFWADLGTG SGAIAIALAR ALPQITVLAV DVSAEALAIA RNNAARYGLS DRIRWYQGSW
LVPLADYRGQ LQAIISNPPY IPTQEWQALE PEVRDHEPRQ ALESGPDGLE ALRHLAQAAP
DYLRSLGLWL CEHMAGQSTA VTALLAAIPG YSEIQSHRDL AGRDRFVSAS WSA