Gene Synpcc7942_1945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1945 
SymboluvrC 
ID3775308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2019498 
End bp2021429 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content54% 
IMG OID637800387 
Productexcinuclease ABC subunit C 
Protein accessionYP_400962 
Protein GI81300754 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.892148 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGCTG CGACGCCAAT GCCACTGCTG AAGCAACCCG ATCGCCTGGA AGCGCGATTG 
CGGGAGCTAC CGGCTGAGCC CGGCGTTTAT TTCATGCGTG ATGCCAGCGA TCGCATTCTC
TACATCGGCA AAAGCAAAAA GCTGCGATCG CGGGTGCGTT CCTATTTCCG TGACCTCGAG
CGGCTTAACC CCCGCATCAA TTTGATGGTG CGGCAAGTTT GTGAAATCGA AATTATTGTT
ACCGATACAG AAGCTGAGGC ACTTGCTCTC GAAGCAAACC TGATTAAACA GCATCAGCCC
CATTTCAATG TCCTGCTCAA GGACGACAAG AAATATCCCT ATCTCTGCAT CACTTGGAGC
GATGACTATC CTCGTATTTT TATCACTCGC AAACGGCGAC TAGGGAATAG TCGCGATCGC
TACTACGGCC CCTACGTTGA TACGCGCTTA CTGCGCCACA CACTCTTTTT AGTCAAGCGT
TTGTTTCCGC TGCGGCAGCG ACCCCAACCG CTGTTCAAAG ATCGCACTTG CCTGAATTAC
GACATTGGTC GCTGCCCAGG GGTCTGTCAG TCGTTAATCC GACCGGATGA CTATCGCAAG
ACACTGCAGA AAGTCGCGAT GATCTTCCAA GGGCGCAGCA GCGAATTAGT GGAGCTGCTC
GAAGCGCAAA TGCTGCAGGC AGCCGAGAAT TTAGAGTTTG AAAAAGCAGC GAAAATTCGT
GATCAAATTC GTGGCCTGGA AGGCTTAGGG GCGGAGCAGA AAGTGCAGCT CCCGGATGAC
CGAATCTCGC GAGATGCGAT CGCGCTGGCA ATGGATGAGC AGCATGCTTG CATCCAACTC
TTTCAGATTC GGGCGGGTAA GTTAGTCGGT CGACTAGGGT TTGTGGCGGA TGCTCAATCG
GGTAGTGCTG CTGCGATCGC GCAACGGGTG TTAGAAGAGC ATTACGCCAG CGTTGATTCG
GTGGAAATTC CTCAAGAAGT CCTCGTTCAG CACGACCTCC CTGAAGCAGA ATTGCTAGAA
GTTTGGCTCT CGGAGCGGCG GGGCCGCAAA GTCGAGATCC TGTCTCCACA ACGGCAGATC
AAAGCTGACC TCATCGCCAT GGTCGAGCGC AATGCAGAGT ACGAACTGGC CAGAACTCAG
CAGTCGGCAG AACGCCATAC TGCCTCACTG ATTGATCTGG CGGATCTGTT GGATTTGCCC
GAGTTACCTC GACGAATTGA AGGCTACGAT ATTTCCCACA TTCAAGGGTC GGATGCGGTG
GCTTCGCAGG TGGTCTTCAT TGATGGCTTA CCCGCCAAGC AACACTATCG CCGCTACAAG
ATTCGCAACC CTGAAGTTCG CGCCGGTCAT TCCGATGACT TCGCCAGTTT AGCGGAGGTG
CTCCACCGGC GTTTCCGCAA GTTTGCGGAG GCGAAGGCCC GAGGTGAATC CTTGGCACCC
AGTGAACAAC GACAAGGTAG TTTATTGCGA CCGGATGACC TCGCAGATTT CCCCGATCTG
GTGATGATTG ATGGCGGTAA AGGACAACTC TCAGCTGTTG TCGAAGTTCT GCGCAATCTG
AATCTGCTGG AGGATGTCAA GCTCTGCAGC TTGGCTAAGA AGCGTGAGGA AATTTTCTTG
CCAGGAGCTT CCGATCCGCT GCCAACGGAT GCCGAACAAC CGGGCGTCCA ATTACTCCGG
CGGCTGCGCG ATGAAGCCCA CCGCTTTGCG GTTAGTTTTC ATCGCCAGAA ACGAACGGAA
CGGATGCGGC GATCGCGCTT GGATGACATT CCGGGGTTGG GCCACAAGCG CCAGAAAGAG
CTGCTGGCTC ACTTCCGTTC AATTGACTAT TTGCGATTGG CAACGCCTGA ACAAATTGCC
GAAGTGCCCG GTATTGGCGC AGTTCTTGCC CAGCAAATTT GGGGCTATTT CCATCCCAGC
GAAACAGCTT GA
 
Protein sequence
MIAATPMPLL KQPDRLEARL RELPAEPGVY FMRDASDRIL YIGKSKKLRS RVRSYFRDLE 
RLNPRINLMV RQVCEIEIIV TDTEAEALAL EANLIKQHQP HFNVLLKDDK KYPYLCITWS
DDYPRIFITR KRRLGNSRDR YYGPYVDTRL LRHTLFLVKR LFPLRQRPQP LFKDRTCLNY
DIGRCPGVCQ SLIRPDDYRK TLQKVAMIFQ GRSSELVELL EAQMLQAAEN LEFEKAAKIR
DQIRGLEGLG AEQKVQLPDD RISRDAIALA MDEQHACIQL FQIRAGKLVG RLGFVADAQS
GSAAAIAQRV LEEHYASVDS VEIPQEVLVQ HDLPEAELLE VWLSERRGRK VEILSPQRQI
KADLIAMVER NAEYELARTQ QSAERHTASL IDLADLLDLP ELPRRIEGYD ISHIQGSDAV
ASQVVFIDGL PAKQHYRRYK IRNPEVRAGH SDDFASLAEV LHRRFRKFAE AKARGESLAP
SEQRQGSLLR PDDLADFPDL VMIDGGKGQL SAVVEVLRNL NLLEDVKLCS LAKKREEIFL
PGASDPLPTD AEQPGVQLLR RLRDEAHRFA VSFHRQKRTE RMRRSRLDDI PGLGHKRQKE
LLAHFRSIDY LRLATPEQIA EVPGIGAVLA QQIWGYFHPS ETA