Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_3843 |
Symbol | uvrA |
ID | 7102133 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 4021497 |
End bp | 4024370 |
Gene Length | 2874 bp |
Protein Length | 957 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643476848 |
Product | excinuclease ABC subunit A |
Protein accession | YP_002373949 |
Protein GI | 218248578 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGATC CTAATACTAT CCGCATTCGT GGCGCAAGAC AGCATAATTT AAAGAATATT GACCTGGATC TCCCCCGCGA TCGCCTCATC GTTTTTACTG GGGTTTCTGG TTCGGGTAAG TCGTCTTTGG CCTTTGATAC CATTTTTGCC GAAGGACAAA GACGCTATGT TGAGTCCCTT AGTGCCTATG CGCGGCAGTT TTTGGGACAA TTAGATAAAC CCGATGTAGA CGCGATCGAG GGGTTAAGTC CGGCTATTTC CATCGATCAA AAGTCCACCT CCCATAACCC CCGTTCGACG GTGGGGACGG TGACGGAAAT TTATGACTAT TTGCGGTTAT TGTTTGGACG GGCCGGAGAA CCCCACTGTC CGATCTGCGA TCGCAGTATC AGTCCCCAAA ATATTGATCA GATGTGCGAT CGCGTGATGG AACTGCCTGA TCGCACTAAA TTTCAGATTC TTTCTCCGGT GGTTCGCGGC AAAAAGGGAA CCCATAAACA ACTGTTATCA AGTTTAGCAT CTCAAGGATT TGTTCGGGTC AGAATTAATG GAGAAGTTCG AGAACTTTCT GATGCAATTG ACTTAAATAA AAATCATCAC CATAACATTG AAATTGTGAT TGATCGCCTC ATTAAAAAGC CAGGAATTGA AGAAAGATTA GTCGATTCTT TAACTACCTG TTTAAAGCAA TCAGAGGGGT TGGCAGTTAT TGATATTTTA GACGATGAAG ATAAGGATAA TCAAGAGGAA AAAGTCCCTT CAGAAATTAC CTTTTCAGAA AACTTTGCTT GTCCTGAACA TGGTGCAGTG ATAGAGGAAT TGTCCCCTCG TTTGTTTTCC TTTAATTCTC CCTATGGTGC TTGCCCTCAT TGTCATGGAT TAGGCAGTTT ACGGCAATTT TCGGCTGACT TAGTGATTCC TGATCCCAGT GCTTCTTTAT ATGCGGCGAT CGCCCCTTGG TCAGATAAAG ACAATTCCTA TTATCTTTCC TTACTCTATA GTGTCGGTCA AACTTGTGGG TTTGATATTC AAACTCCCTG GAATAAATTA ACCAAAGAAC AACAAAATAT CCTCCTTTAT GGTCAAGAAG AACCCATCTG GTTTGAGGAT GATTCTCGCT CAAAAAATAG TGAAGGATAC TATCGAAAAT TTGGTGGTAT TCTGGCAATG TTAGAACGCA GTTATCAAGA AACTAGCTCA GAAATTATTA AACAGAAATT AGAAAAATAT ATTGTTGATC GAACCTGTGA GGTTTGCCAA GGAAAACGAC TGAAACCCGA AGCTTTATCA GTTCGCTTAG GACAATATAA AATTGATCAA TTGACCAGTG TTTCTATTGA TAAATGCTTA GAAAGAGTCA ATCAATTGGA ATTAACCCCT AGACAAGCTT TAATTGGAGA ATTAGCCCTC AAAGAAATCA AAAACCGTCT ACAATTTCTC CTAGACGTTG GGTTAGATTA TTTAACCCTA GATAGAGGAA CCATGACCCT ATCAGGAGGA GAAGCACAAC GCATCCGTTT AGCCACACAA ATTGGCTCAG GACTCACGGG GGTATTATAT GTTTTAGATG AGCCGAGTAT TGGATTACAT CAACGAGATA ATCAACGCTT ATTAAATACC CTGAGAAAAC TCAGAGATCT CGGCAATACT TTAATAGTCG TAGAACACGA TCAAGAAACC ATAGAATGCG CCGATCATTT AGTTGATATT GGTCCCTTAG CAGGAGTTCA CGGAGGTAAG ATTGTTTGTC AAGGAAATTT AGAGACGCTA CTCAGCGATC AAACCTCTCT AACTGGGGCT TATTTATCGG GCAGAAAAGT TATAGAAACT CCTGAAAAAC GTCGCAAAGG AAATGGGTCT TCATTACAGC TAAAAAATTG TTGTCAAAAT AATCTTAAAA ACATCAATGT TGAGATTCCA TTAGGTAAAT TAGTTTGTAT TACTGGTCTT TCTGGTTCAG GAAAGTCTAC CCTAGTTAAT GAGTTATTAT ATCCTGCTTT ACAGCATCAT CTAACCCGTC AAGTCCCTTT TCCTAAAAAT TTAGAGCAAA TAAAAGGACT GAAAGCAGTT GATAAAGTGA TTGTTATTGA TCAGTCTCCC ATTGGCAGAA CTCCTCGGTC TAATCCCGCT ACCTACACAG GAGTTTTTGA TACAATTAGA GAACTATTTT CTCAAACTAT CGAAGCAAAA GCAAGAGGAT ATAAACAAGG TCAATTTTCC TTTAATGTAA AAGGGGGAAG ATGTGAAGTT TGTAATGGTC AGGGAGTGAA TATCATTGAA ATGAATTTTC TCCCAGATGT TTATGTACAA TGTGACGTTT GTAAAGGCGC AAGATACAAC CGAGAAACCC TGCAAGTGAA GTATAAAGAT TATTCTATTG CTGATGTTTT GAACATGACT GTTGAGGAAG CATTAGACGT ATTTCAGAAT ATTCCTAAAG CCGTTAAACG GTTACAAACC TTAGTAGATG TTGGTCTAGG TTATATCAAA TTAGGACAGT CTGCCCCCAC CTTATCAGGA GGAGAAGCAC AACGACTGAA ATTAGCCTCA GAATTGTCAA AAAGAGCAAC AGGAAAAACC CTTTATTTAA TTGATGAACC AACCACCGGA CTCTCTTTTT ATGATGTCCA TCACTTATTA AATGTTCTAC AAAGATTAGT CGATAAAGGC AATTCTATTT TAGTGATTGA ACACAATTTA GATGTCATTC GTTGTGCAGA TTGGATCATT GATTTAGGAC CTGAAGGAGG AGATAAAGGG GGAGAAATTA TCGCGTTAGG AACTCCTGAA GAAGTGGCTA ATAATTCTAA TTCTTATACA GGAAAATATT TAAAACAAGC CTTACAACAA CATCCAACAG CCAAGCAAAT TTAG
|
Protein sequence | MSDPNTIRIR GARQHNLKNI DLDLPRDRLI VFTGVSGSGK SSLAFDTIFA EGQRRYVESL SAYARQFLGQ LDKPDVDAIE GLSPAISIDQ KSTSHNPRST VGTVTEIYDY LRLLFGRAGE PHCPICDRSI SPQNIDQMCD RVMELPDRTK FQILSPVVRG KKGTHKQLLS SLASQGFVRV RINGEVRELS DAIDLNKNHH HNIEIVIDRL IKKPGIEERL VDSLTTCLKQ SEGLAVIDIL DDEDKDNQEE KVPSEITFSE NFACPEHGAV IEELSPRLFS FNSPYGACPH CHGLGSLRQF SADLVIPDPS ASLYAAIAPW SDKDNSYYLS LLYSVGQTCG FDIQTPWNKL TKEQQNILLY GQEEPIWFED DSRSKNSEGY YRKFGGILAM LERSYQETSS EIIKQKLEKY IVDRTCEVCQ GKRLKPEALS VRLGQYKIDQ LTSVSIDKCL ERVNQLELTP RQALIGELAL KEIKNRLQFL LDVGLDYLTL DRGTMTLSGG EAQRIRLATQ IGSGLTGVLY VLDEPSIGLH QRDNQRLLNT LRKLRDLGNT LIVVEHDQET IECADHLVDI GPLAGVHGGK IVCQGNLETL LSDQTSLTGA YLSGRKVIET PEKRRKGNGS SLQLKNCCQN NLKNINVEIP LGKLVCITGL SGSGKSTLVN ELLYPALQHH LTRQVPFPKN LEQIKGLKAV DKVIVIDQSP IGRTPRSNPA TYTGVFDTIR ELFSQTIEAK ARGYKQGQFS FNVKGGRCEV CNGQGVNIIE MNFLPDVYVQ CDVCKGARYN RETLQVKYKD YSIADVLNMT VEEALDVFQN IPKAVKRLQT LVDVGLGYIK LGQSAPTLSG GEAQRLKLAS ELSKRATGKT LYLIDEPTTG LSFYDVHHLL NVLQRLVDKG NSILVIEHNL DVIRCADWII DLGPEGGDKG GEIIALGTPE EVANNSNSYT GKYLKQALQQ HPTAKQI
|
| |