Gene PCC8801_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3843 
SymboluvrA 
ID7102133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4021497 
End bp4024370 
Gene Length2874 bp 
Protein Length957 aa 
Translation table11 
GC content39% 
IMG OID643476848 
Productexcinuclease ABC subunit A 
Protein accessionYP_002373949 
Protein GI218248578 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGATC CTAATACTAT CCGCATTCGT GGCGCAAGAC AGCATAATTT AAAGAATATT 
GACCTGGATC TCCCCCGCGA TCGCCTCATC GTTTTTACTG GGGTTTCTGG TTCGGGTAAG
TCGTCTTTGG CCTTTGATAC CATTTTTGCC GAAGGACAAA GACGCTATGT TGAGTCCCTT
AGTGCCTATG CGCGGCAGTT TTTGGGACAA TTAGATAAAC CCGATGTAGA CGCGATCGAG
GGGTTAAGTC CGGCTATTTC CATCGATCAA AAGTCCACCT CCCATAACCC CCGTTCGACG
GTGGGGACGG TGACGGAAAT TTATGACTAT TTGCGGTTAT TGTTTGGACG GGCCGGAGAA
CCCCACTGTC CGATCTGCGA TCGCAGTATC AGTCCCCAAA ATATTGATCA GATGTGCGAT
CGCGTGATGG AACTGCCTGA TCGCACTAAA TTTCAGATTC TTTCTCCGGT GGTTCGCGGC
AAAAAGGGAA CCCATAAACA ACTGTTATCA AGTTTAGCAT CTCAAGGATT TGTTCGGGTC
AGAATTAATG GAGAAGTTCG AGAACTTTCT GATGCAATTG ACTTAAATAA AAATCATCAC
CATAACATTG AAATTGTGAT TGATCGCCTC ATTAAAAAGC CAGGAATTGA AGAAAGATTA
GTCGATTCTT TAACTACCTG TTTAAAGCAA TCAGAGGGGT TGGCAGTTAT TGATATTTTA
GACGATGAAG ATAAGGATAA TCAAGAGGAA AAAGTCCCTT CAGAAATTAC CTTTTCAGAA
AACTTTGCTT GTCCTGAACA TGGTGCAGTG ATAGAGGAAT TGTCCCCTCG TTTGTTTTCC
TTTAATTCTC CCTATGGTGC TTGCCCTCAT TGTCATGGAT TAGGCAGTTT ACGGCAATTT
TCGGCTGACT TAGTGATTCC TGATCCCAGT GCTTCTTTAT ATGCGGCGAT CGCCCCTTGG
TCAGATAAAG ACAATTCCTA TTATCTTTCC TTACTCTATA GTGTCGGTCA AACTTGTGGG
TTTGATATTC AAACTCCCTG GAATAAATTA ACCAAAGAAC AACAAAATAT CCTCCTTTAT
GGTCAAGAAG AACCCATCTG GTTTGAGGAT GATTCTCGCT CAAAAAATAG TGAAGGATAC
TATCGAAAAT TTGGTGGTAT TCTGGCAATG TTAGAACGCA GTTATCAAGA AACTAGCTCA
GAAATTATTA AACAGAAATT AGAAAAATAT ATTGTTGATC GAACCTGTGA GGTTTGCCAA
GGAAAACGAC TGAAACCCGA AGCTTTATCA GTTCGCTTAG GACAATATAA AATTGATCAA
TTGACCAGTG TTTCTATTGA TAAATGCTTA GAAAGAGTCA ATCAATTGGA ATTAACCCCT
AGACAAGCTT TAATTGGAGA ATTAGCCCTC AAAGAAATCA AAAACCGTCT ACAATTTCTC
CTAGACGTTG GGTTAGATTA TTTAACCCTA GATAGAGGAA CCATGACCCT ATCAGGAGGA
GAAGCACAAC GCATCCGTTT AGCCACACAA ATTGGCTCAG GACTCACGGG GGTATTATAT
GTTTTAGATG AGCCGAGTAT TGGATTACAT CAACGAGATA ATCAACGCTT ATTAAATACC
CTGAGAAAAC TCAGAGATCT CGGCAATACT TTAATAGTCG TAGAACACGA TCAAGAAACC
ATAGAATGCG CCGATCATTT AGTTGATATT GGTCCCTTAG CAGGAGTTCA CGGAGGTAAG
ATTGTTTGTC AAGGAAATTT AGAGACGCTA CTCAGCGATC AAACCTCTCT AACTGGGGCT
TATTTATCGG GCAGAAAAGT TATAGAAACT CCTGAAAAAC GTCGCAAAGG AAATGGGTCT
TCATTACAGC TAAAAAATTG TTGTCAAAAT AATCTTAAAA ACATCAATGT TGAGATTCCA
TTAGGTAAAT TAGTTTGTAT TACTGGTCTT TCTGGTTCAG GAAAGTCTAC CCTAGTTAAT
GAGTTATTAT ATCCTGCTTT ACAGCATCAT CTAACCCGTC AAGTCCCTTT TCCTAAAAAT
TTAGAGCAAA TAAAAGGACT GAAAGCAGTT GATAAAGTGA TTGTTATTGA TCAGTCTCCC
ATTGGCAGAA CTCCTCGGTC TAATCCCGCT ACCTACACAG GAGTTTTTGA TACAATTAGA
GAACTATTTT CTCAAACTAT CGAAGCAAAA GCAAGAGGAT ATAAACAAGG TCAATTTTCC
TTTAATGTAA AAGGGGGAAG ATGTGAAGTT TGTAATGGTC AGGGAGTGAA TATCATTGAA
ATGAATTTTC TCCCAGATGT TTATGTACAA TGTGACGTTT GTAAAGGCGC AAGATACAAC
CGAGAAACCC TGCAAGTGAA GTATAAAGAT TATTCTATTG CTGATGTTTT GAACATGACT
GTTGAGGAAG CATTAGACGT ATTTCAGAAT ATTCCTAAAG CCGTTAAACG GTTACAAACC
TTAGTAGATG TTGGTCTAGG TTATATCAAA TTAGGACAGT CTGCCCCCAC CTTATCAGGA
GGAGAAGCAC AACGACTGAA ATTAGCCTCA GAATTGTCAA AAAGAGCAAC AGGAAAAACC
CTTTATTTAA TTGATGAACC AACCACCGGA CTCTCTTTTT ATGATGTCCA TCACTTATTA
AATGTTCTAC AAAGATTAGT CGATAAAGGC AATTCTATTT TAGTGATTGA ACACAATTTA
GATGTCATTC GTTGTGCAGA TTGGATCATT GATTTAGGAC CTGAAGGAGG AGATAAAGGG
GGAGAAATTA TCGCGTTAGG AACTCCTGAA GAAGTGGCTA ATAATTCTAA TTCTTATACA
GGAAAATATT TAAAACAAGC CTTACAACAA CATCCAACAG CCAAGCAAAT TTAG
 
Protein sequence
MSDPNTIRIR GARQHNLKNI DLDLPRDRLI VFTGVSGSGK SSLAFDTIFA EGQRRYVESL 
SAYARQFLGQ LDKPDVDAIE GLSPAISIDQ KSTSHNPRST VGTVTEIYDY LRLLFGRAGE
PHCPICDRSI SPQNIDQMCD RVMELPDRTK FQILSPVVRG KKGTHKQLLS SLASQGFVRV
RINGEVRELS DAIDLNKNHH HNIEIVIDRL IKKPGIEERL VDSLTTCLKQ SEGLAVIDIL
DDEDKDNQEE KVPSEITFSE NFACPEHGAV IEELSPRLFS FNSPYGACPH CHGLGSLRQF
SADLVIPDPS ASLYAAIAPW SDKDNSYYLS LLYSVGQTCG FDIQTPWNKL TKEQQNILLY
GQEEPIWFED DSRSKNSEGY YRKFGGILAM LERSYQETSS EIIKQKLEKY IVDRTCEVCQ
GKRLKPEALS VRLGQYKIDQ LTSVSIDKCL ERVNQLELTP RQALIGELAL KEIKNRLQFL
LDVGLDYLTL DRGTMTLSGG EAQRIRLATQ IGSGLTGVLY VLDEPSIGLH QRDNQRLLNT
LRKLRDLGNT LIVVEHDQET IECADHLVDI GPLAGVHGGK IVCQGNLETL LSDQTSLTGA
YLSGRKVIET PEKRRKGNGS SLQLKNCCQN NLKNINVEIP LGKLVCITGL SGSGKSTLVN
ELLYPALQHH LTRQVPFPKN LEQIKGLKAV DKVIVIDQSP IGRTPRSNPA TYTGVFDTIR
ELFSQTIEAK ARGYKQGQFS FNVKGGRCEV CNGQGVNIIE MNFLPDVYVQ CDVCKGARYN
RETLQVKYKD YSIADVLNMT VEEALDVFQN IPKAVKRLQT LVDVGLGYIK LGQSAPTLSG
GEAQRLKLAS ELSKRATGKT LYLIDEPTTG LSFYDVHHLL NVLQRLVDKG NSILVIEHNL
DVIRCADWII DLGPEGGDKG GEIIALGTPE EVANNSNSYT GKYLKQALQQ HPTAKQI