Gene Synpcc7942_1060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1060 
Symbol 
ID3773990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1070025 
End bp1072052 
Gene Length2028 bp 
Protein Length675 aa 
Translation table11 
GC content54% 
IMG OID637799482 
Producttype I restriction-modification 
Protein accessionYP_400077 
Protein GI81299869 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000030221 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0378969 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTTTG AATCCTCCCA ATCAACGACC GAAATCAGCA GTCTTCAAGC TGAGTTCCTC 
GGGGTGCTTC AAGCCCTCGG TGGCTCAGCG GGGAATAGCA AGCTGCAACA GTTGTTGGCG
TGGGATGATG CTCGCTACGA GTCGGTCAAA GCAGAACTGA TGCTGCAAGG CCGGATTCAA
TCAGGACGGG GACGCGGCGG GTCTGTGACG TTGATGGATA GTCAGGGAAT TGCGATCGCT
ACGCCACCAG ACCTCCCAAT GGCCGCTCCT CTAGAACCGC AACCGACTGG ACGCCAGAAT
CTCTCGGCCT TCATTTGGAG TGTGGCTGAC CTGCTGCGCG GCGACTACAA ACAGAGTGAC
TACGGCAAAA TCATCCTGCC GTTTACGGTG CTGCGGCGCT TGGACTGTGT GCTTGCCCCG
ACGAAAGCCG CTGTTCTTGA AGAGAAAGTC CTACGCGAAA GCCAAGGCTT AGCTCCAGAA
CCGTTCTTGC TGAAGAAAGC AGGGCAGAAC TTCTGCAATA CCTCACCGCT CGATCTCAAG
CAGCTGATGG GTGATGCCGA CAACATTGGC GAGAACCTGC GTGCTTATAT CCAAGGCTTC
ACGCCAGCAG TGCGGGACAT TTTCGATAGC TTTGAGTTTC ATCTACAAAT CGATCGCCTC
GAAAAAGCGG GTCTGCTTTA CCTTGTCACT GAACGCTTTG CGCAGATCGA CCTCCATCCC
GATACGGTTA GCAATGCCGA GATGGGTTTG GTATTTGAGG AGCTGATTCG CAAATTTGCC
GAACTCTCGA ATGAAACGGC TGGGGAACAC TTCACGCCCC GCGAAGTGAT TCGGCTGATG
GTCAATTTGC TCTTTATTGA GGATGATGCC GCGCTGACTC AGCCTGGAAT TGTCCGCAGT
CTGTATGACC CCACTGCAGG CACGGGCGGC ATGCTCAGTG TGGCCGAGGA ACATCTAACG
GAGCTGAATC CTTCGGCGCG ATTGGTGCTG TCGGGGCAGG AATTGAACCC AGAGTCCTAT
GCGATCTGCA AAGCCGACAT GCTGATCAAG GGACAAAACA TCCAGAATAT TTGCTTTGGC
AACACGCTCT CCGATGACAA GTTGCCCGAT GCCAAGTACG ACTACATGCT GTCGAATCCT
CCCTTCGGCG TCGAGTGGAA AAAGATTCAG AAGGAAGTCC AGCGAGAAGC TGAACAGTTG
GGCTACAGCG GTCGTTTTGG TCCTGGCTTA CCTCGCGTCA GTGATGGCTC GTTGCTCTTT
CTCCTGCATC TGATCTCGAA GATGCGACCT GCTAGTGAAG GCGGCAGCCG TCTGGGGATT
GTGCTGAATG GATCACCGTT ATTTACCGGC GGGGCTGGGT CTGGCGAGAG CGAGATTCGC
CGCTATGTCC TTGAAAACGA TCTGGTCGAG GCGATTATCG CTTTGCCCAC GGACATGTTC
TACAACACAG GTATCAGCAC CTATATCTGG ATTCTGAGTA ACCGGAAGCC TGCAAGCCGC
AAAGGCAAAG TTCAGCTAAT TGATGCCAGT GGTTTTTGGC AGAAGATGCG CAAGAGTTTG
GGCAGTAAGC GCAAGGAACT GAGCGAGGAG CAGATTGCGG AGATTACGCG GTTGTTCGGC
AACTTTGAGG AAGCCGATCG CGATGGGAAA CCCGTTAGCA AAATCTTTCG CAATGAAGAG
TTTGGCTATC GCACGATTAC AGTGGAGCGT CCGCAGCGAG ATGAGGCGGG GAATGTCGTA
CTGGCGCAGC GGGGCAAGAC TAAGGGGCAG CCTGTGGCGG ATGCCAGCTT ACGGGATACC
GAGAATGTGC CGCTAACTGA GGATGTAGAC ACCTATTTCC AGCGAGAAGT GTTGCCGCAT
GTGCCGGATG CTTGGATCGA CCCAGAGAAA ACCAAAGTCG GTTACGAGAT TCCCTTTAAC
CGGCATTTTT ATGTGTTTAC GCCACCGCGA TCGCTGGAGG AAATTGATGC GGAGTTACAG
CAAGTCACCG ATCGCATTCT GACAATGCTC GGGGGCTTAT CTCACTAA
 
Protein sequence
MTFESSQSTT EISSLQAEFL GVLQALGGSA GNSKLQQLLA WDDARYESVK AELMLQGRIQ 
SGRGRGGSVT LMDSQGIAIA TPPDLPMAAP LEPQPTGRQN LSAFIWSVAD LLRGDYKQSD
YGKIILPFTV LRRLDCVLAP TKAAVLEEKV LRESQGLAPE PFLLKKAGQN FCNTSPLDLK
QLMGDADNIG ENLRAYIQGF TPAVRDIFDS FEFHLQIDRL EKAGLLYLVT ERFAQIDLHP
DTVSNAEMGL VFEELIRKFA ELSNETAGEH FTPREVIRLM VNLLFIEDDA ALTQPGIVRS
LYDPTAGTGG MLSVAEEHLT ELNPSARLVL SGQELNPESY AICKADMLIK GQNIQNICFG
NTLSDDKLPD AKYDYMLSNP PFGVEWKKIQ KEVQREAEQL GYSGRFGPGL PRVSDGSLLF
LLHLISKMRP ASEGGSRLGI VLNGSPLFTG GAGSGESEIR RYVLENDLVE AIIALPTDMF
YNTGISTYIW ILSNRKPASR KGKVQLIDAS GFWQKMRKSL GSKRKELSEE QIAEITRLFG
NFEEADRDGK PVSKIFRNEE FGYRTITVER PQRDEAGNVV LAQRGKTKGQ PVADASLRDT
ENVPLTEDVD TYFQREVLPH VPDAWIDPEK TKVGYEIPFN RHFYVFTPPR SLEEIDAELQ
QVTDRILTML GGLSH