Gene Synpcc7942_0848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_0848 
SymboluvrA 
ID3774025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp842336 
End bp845347 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content57% 
IMG OID637799264 
Productexcinuclease ABC subunit A 
Protein accessionYP_399866 
Protein GI81299658 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.23949 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAGC GCGCTTCTGT TGAGCCTTCA TCTGATCTGG CTCATCGTAA TGGCGATCGT 
CCAGCAGCGG AGACGGATCA AATTCGGGTA CGGGGAGCCC GCCAGCATAA TCTCCAAAAC
CTCGATCTCG ACTTACCCCG TAACCAGCTG ATCGTATTCA CGGGGGTGAG TGGCTCTGGC
AAGTCGTCCT TGGCCTTCGA CACGATTTTT GCCGAGGGAC AACGCCGCTA CGTTGAGTCC
CTATCGGCAT ACGCCCGTCA GTTCCTCGGT CAAGTCGATA AGCCCGATGT TGACGCGATC
GAGGGTTTGA GTCCTGCTAT TTCAATCGAC CAAAAGTCGA CTTCTCACAA TCCCCGCTCG
ACGGTGGGCA CCGTCACCGA AATCTACGAT TATCTGCGCT TGCTCTTTGG GCGAGCGGGT
GCTCCCCACT GCCCTCACTG CGATCGCTCG ATCGCCCCGC AAACCATTGA TCAAATGGTC
GATCGCCTGG CAGAACTGCC TGATCGCAGC CGCTTTCAAA TTCTGGCGCC GGTGGTACGC
GGCAAAAAAG GAACCCATAA AAAGTTGCTT TCCAGTCTGG CAGCAGAAGG CTTTGTACGG
GTGCGGATCA ACGGCGAAGT GCGAGAACTG AGCGACAGCA TTGAGCTGGA TAAAAATCAT
GCTCACACAA TTGAAATTGT GGTCGATCGC CTCGTTCAAA AGGCCGGCAT CCAAGAGCGA
CTGGTGGATT CCCTGACGAC CTGTTTGCGC CGATCTGAAG GAATTGCGGT GATCGATGTC
CTGCCGGGCG ATCGCGGTAA CGTCATCACC CTGCCGGTTG AGCACCATTC CAGCCGAGCC
GCTGATACGG GCGCTGAATA CAATCGAGAG CCCCAGGGCT ATGAACTGGT GTTCTCCGAA
AACTTTGCCT GCCCAGAGCA TGGCGCGGTC ATGGAGGAGC TATCGCCACG CCTCTTTTCC
TTTAACTCGC CCTACGGGGC CTGCTCCCAT TGCCATGGCT TGGGCACCCT GCGCACCTTT
TCACCAGAGT TGGTGGTTCC TGATCCCAAC CTGCCGGTCT ACGCGGCGAT CGCGCCTTGG
TCCGAGAAAG ATCACAGCTA CTACTTCAAT CTCCTCTGTG GTGTGGCTGA AGTCAGTGGC
TTTGAAATCA GCGATCGCTG GCAGGATCTG ACGCCTGATC AGCAGCATGC GATTCTCTTC
GGCAGTAGTG AGCCGATCGC CGTCGAAACC GACTCGCGCT TCCGCGATCG CCATCTCTAC
CACCGCCCCT TCGAAGGTGT CATCCCCATC TTGGAGCGGC AGTATCGAGA GACCAGCTCG
GAAGTCTACA AACAAAAGCT CGAGAAATAT TTAGTCGACC AAACCTGCGA AGTCTGCCAC
GGCCTGCGCC TCAAGCCCGA ATCGTTGGCG GTACGCATGG GACCCTACCG CATCACCGAC
CTAACCAGTG TCTCGGTGCG GGAATGTCTG AACAAAATTG AGGCCCTGAT CGGCAGCGAC
AATCGCGAAC CGCTGTTGAC TGCCCGCCAA ATGCAGATTG GCGATCTGGT ACTGCGGGAA
ATTCAAGCGC GCCTGCGCTT CCTGCTCGAT GTTGGCTTGG ACTATCTCAC CCTCGATCGC
CCAGCGATGA CCCTCTCCGG CGGTGAAGCC CAGCGGATTC GCTTGGCGAC CCAAATTGGC
GCGGGGCTGA CTGGCGTGCT CTACGTTTTG GATGAACCGA GTATTGGTCT CCACCAACGC
GATAACGATC GCCTGCTCAA CACCTTGATT CGCCTGCGCG ATCTCGGTAA CACCCTAATC
GTGGTCGAGC ACGATGAAGA TACGATTCGC GCTGCTGATC ACCTCGTGGA TATTGGCCCC
GGTGCTGGTG TTCACGGCGG CCGCATTGTG GCTCAAGGGA ACTTGAACCA GTTATTGACG
GCTGAAGAGT CTCTCACAGG GGCCTATCTC TCGGGTCGGC GGCGCATTGC TACGCCTGAC
AGCCGTCGCG CTGGGAATGG CCGTCGTCTG CTTCTCCAAG GTGCGAACCG CAACAACCTC
CAAAATCTGG ATGTAGAAAT CCCCCTCGGC AAGCTCGTCT GTATCACCGG TGTCAGCGGG
TCGGGTAAGT CGACGCTGAT CAACGAACTC TTGCATCCGG CCTTGGAACA TAGTTTGGGG
CTCAAGGTCC CGTTTCCCAA AGGCCTGGCA GTGCTGAAAG GGACGAAAGC AATCGACAAA
GTGATTGTGA TCGATCAGTC ACCCATTGGT CGTACACCGC GATCGAATCC TGCCACCTAT
ACGGGGGCAT TTGACCCCAT CCGGCAGGTA TTTGCAGCGA CCACGGAAGC CAAAGCCCGT
GGCTATCAAC CTGGGCAATT CTCGTTCAAC GTCAAAGGCG GCCGCTGCGA AGCCTGTGGC
GGGCAGGGCG TCAACGTGAT TGAGATGAAC TTCCTGCCCG ATGTCTACGT CCAGTGTGAT
GTCTGCAAAG GCGCTCGCTA TAACCGCGAG ACGTTACAGG TCAAGTACAA GGGCAAAACG
ATCGCGGACG TCCTGAACAT GACCGTTGAA GAAGCGGCAC AGTTCTTCGA GAACATTCCC
CAAGCCAGTA GCCGTCTCCA AACCTTGGTT GATGTCGGCC TCGGCTATGT TCGCCTAGGG
CAACCGGCAC CCACCCTCTC AGGCGGTGAG GCTCAGCGTG TCAAGCTAGC GACGGAACTA
GCCCGACGAG CAACCGGTAA GACGCTCTAT CTCATTGATG AGCCGACGAC TGGTCTCTCT
TTCTACGATG TCCACAAGCT GCTAGACGTA ATGCAGCGTT TAGTCGATAA AGGCAATTCG
ATTGTGGTGA TTGAACACAA CTTAGATGTG ATCCGTTGTG CTGACTGGAT TATCGACCTC
GGGCCTGAAG GCGGCGATCG CGGCGGCGAA ATCGTTGTTT GCGGCGAGCC AGAGATCGTA
GCAACCCACA ATCAATCTCA CACAGGCCGC TATCTGGCCC AAGTCCTTGC CCAGCATCCG
CCCGGTCAGT AG
 
Protein sequence
MPKRASVEPS SDLAHRNGDR PAAETDQIRV RGARQHNLQN LDLDLPRNQL IVFTGVSGSG 
KSSLAFDTIF AEGQRRYVES LSAYARQFLG QVDKPDVDAI EGLSPAISID QKSTSHNPRS
TVGTVTEIYD YLRLLFGRAG APHCPHCDRS IAPQTIDQMV DRLAELPDRS RFQILAPVVR
GKKGTHKKLL SSLAAEGFVR VRINGEVREL SDSIELDKNH AHTIEIVVDR LVQKAGIQER
LVDSLTTCLR RSEGIAVIDV LPGDRGNVIT LPVEHHSSRA ADTGAEYNRE PQGYELVFSE
NFACPEHGAV MEELSPRLFS FNSPYGACSH CHGLGTLRTF SPELVVPDPN LPVYAAIAPW
SEKDHSYYFN LLCGVAEVSG FEISDRWQDL TPDQQHAILF GSSEPIAVET DSRFRDRHLY
HRPFEGVIPI LERQYRETSS EVYKQKLEKY LVDQTCEVCH GLRLKPESLA VRMGPYRITD
LTSVSVRECL NKIEALIGSD NREPLLTARQ MQIGDLVLRE IQARLRFLLD VGLDYLTLDR
PAMTLSGGEA QRIRLATQIG AGLTGVLYVL DEPSIGLHQR DNDRLLNTLI RLRDLGNTLI
VVEHDEDTIR AADHLVDIGP GAGVHGGRIV AQGNLNQLLT AEESLTGAYL SGRRRIATPD
SRRAGNGRRL LLQGANRNNL QNLDVEIPLG KLVCITGVSG SGKSTLINEL LHPALEHSLG
LKVPFPKGLA VLKGTKAIDK VIVIDQSPIG RTPRSNPATY TGAFDPIRQV FAATTEAKAR
GYQPGQFSFN VKGGRCEACG GQGVNVIEMN FLPDVYVQCD VCKGARYNRE TLQVKYKGKT
IADVLNMTVE EAAQFFENIP QASSRLQTLV DVGLGYVRLG QPAPTLSGGE AQRVKLATEL
ARRATGKTLY LIDEPTTGLS FYDVHKLLDV MQRLVDKGNS IVVIEHNLDV IRCADWIIDL
GPEGGDRGGE IVVCGEPEIV ATHNQSHTGR YLAQVLAQHP PGQ