Gene Cyan8802_3893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3893 
SymboluvrA 
ID8393243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4002677 
End bp4005550 
Gene Length2874 bp 
Protein Length957 aa 
Translation table11 
GC content39% 
IMG OID644981818 
Productexcinuclease ABC subunit A 
Protein accessionYP_003139532 
Protein GI257061644 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0968231 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATC CTAATACTAT CCGCATTCGT GGCGCAAGAC AGCATAATTT AAAGAATATT 
GACCTGGATC TCCCCCGCGA TCGCCTCATC GTTTTTACTG GGGTTTCTGG TTCGGGTAAG
TCGTCTTTGG CCTTTGATAC TATTTTTGCC GAAGGACAAA GACGCTATGT TGAGTCCCTT
AGTGCCTATG CGCGGCAGTT TTTGGGACAA TTAGATAAAC CCGATGTAGA CGCGATCGAG
GGGTTAAGTC CGGCTATTTC CATCGATCAA AAGTCCACCT CCCATAACCC CCGTTCGACG
GTGGGGACGG TGACGGAAAT TTATGACTAT TTGCGGTTAT TGTTTGGACG GGCCGGAGAA
CCCCACTGTC CGATCTGCGA TCGCAGTATC AGTCCCCAAA ATATTGATCA GATGTGCGAT
CGCGTGATGG AACTGCCTGA TCGCACTAAA TTTCAGATTC TTTCTCCGGT GGTTCGCGGC
AAAAAGGGAA CCCATAAACA ACTGTTATCA AGTTTAGCAT CTCAAGGATT TGTTCGGGTC
AGAATTAATG GAGAAGTTCG AGAACTTTCT GATGCAATTG ACTTAAATAA AAATCATCAC
CATAACATTG AAATTGTGAT TGATCGCCTC ATTAAAAAGC CAGGAATTGA AGAAAGATTA
GTCGATTCTT TAACTACCTG TTTAAAGCAA TCAGAGGGGT TGGCAGTTAT TGATATTTTA
GACGATGAAG ATAAGGATAA TCAAGAGGAA AAAGTCCCTT CAGAAATTAC CTTTTCAGAA
AACTTTGCTT GTCCTGAACA TGGTGCAGTG ATAGAGGAAT TGTCCCCTCG TTTGTTTTCC
TTTAATTCTC CCTATGGTGC TTGCCCTCAT TGTCATGGAT TAGGCAGTTT ACGGCAATTT
TCGGCTGACT TAGTGATTCC TGATCCCAGT GCTTCTTTAT ATGCGGCGAT CGCCCCTTGG
TCAGATAAAG ACAATTCCTA TTATCTTTCC TTACTCTATA GTGTCGGTCA AACTTGTGGG
TTTGATATTC AAACTCCCTG GAATAAATTA ACCAAAGAAC AACAAAATAT CCTCCTTTAT
GGTCAAGAAG AACCCATCTG GTTTGAGGAT GATTCTCGCT CAAAAAATAG TGAAGGATAC
TATCGAAAAT TTGGTGGTAT TCTGGCAATG TTAGAACGCA GTTATCAAGA AACTAGCTCA
GAAATTATTA AACAGAAATT AGAAAAATAT ATTGTTGATC GAACCTGTGA GGTTTGCCAA
GGAAAACGAC TGAAACCCGA AGCTTTATCA GTTCGCTTAG GACAATATAA AATTGATCAA
TTGACCAGTG TTTCTATTGA TAAATGCTTA GAAAGAGTCA ATCAATTGGA ATTAACCCCT
AGACAAGCTT TAATTGGAGA ATTAGCCCTC AAAGAAATCA AAAACCGTCT ACAATTTCTC
CTAGACGTTG GGTTAGATTA TTTAACCCTA GATAGAGGAA CCATGACCCT ATCAGGAGGA
GAAGCACAAC GCATCCGTTT AGCCACACAA ATTGGCTCAG GACTCACGGG GGTATTATAT
GTTTTAGATG AGCCGAGTAT TGGATTACAT CAACGAGATA ATCAACGCTT ATTAAATACC
CTGAGAAAAC TCAGAGATCT CGGCAATACT TTAATAGTCG TAGAACACGA TCAAGAAACC
ATAGAATGCG CCGATCATTT AGTTGATATT GGTCCCTTAG CAGGAGTTCA CGGAGGTAAG
ATTGTTTGTC AAGGAAATTT AGAGACGCTA CTCAGCGATC AAACCTCTTT AACTGGGGCT
TATTTATCGG GCAGAAAAGT TATAGAAACT CCTGAAAAAC GTCGCAAAGG AAATGGGTCT
TCATTACAGC TAAAAAATTG TTGTCAAAAT AATCTTAAAA ACATCAATGT TGAGATTCCA
TTAGGTAAAT TAGTTTGTAT TACTGGTCTT TCTGGTTCAG GAAAGTCTAC CCTAGTTAAT
GAGTTATTAT ATCCTGCTTT ACAGCATCAT CTAACCCGTC AAGTTCCTTT TCCTAAAAAT
TTAGAGCAAA TCAAAGGACT GAAAGCAGTT GATAAAGTGA TTGTTATTGA TCAGTCTCCT
ATTGGCAGAA CTCCTCGGTC TAATCCCGCT ACCTACACAG GAGTTTTTGA TACGATTAGA
GAACTATTTT CTCAAACTAT CGAAGCAAAA GCAAGAGGAT ATAAACAAGG TCAATTTTCC
TTTAATGTAA AAGGGGGACG ATGTGAAGTT TGTAATGGTC AGGGAGTCAA TATCATTGAA
ATGAATTTTC TCCCAGATGT TTATGTACAA TGTGACGTTT GTAAAGGCGC AAGATACAAC
CGAGAAACCC TGCAAGTGAA GTATAAAGAT TATTCTATTG CTGATGTTTT GAACATGACT
GTTGAGGAAG CATTAGACGT ATTTCAGAAT ATTCCTAAAG CCGTTAAACG GTTACAAACC
TTAGTAGATG TTGGTCTAGG TTATATCAAA TTAGGACAGT CTGCCCCCAC CTTATCAGGA
GGAGAAGCAC AACGACTGAA ATTAGCCTCA GAATTGTCAA AAAGAGCAAC AGGAAAAACC
CTTTATTTAA TTGATGAACC AACCACTGGA CTCTCTTTTT ATGATGTCCA TCACTTATTA
AATGTTCTAC AAAGATTAGT CGATAAAGGC AATTCTATTT TAGTGATTGA ACACAATTTA
GATGTCATTC GTTGTGCAGA TTGGATCATT GATTTAGGAC CTGAAGGAGG AGATAAAGGG
GGAGAAATTA TCGCGTTAGG AACTCCTGAA GAAGTGGCTA ATAATCCTAA TTCTTATACA
GGAAAATATT TAAAACAAGC CTTACAACAA CATCCAACAG CCAAGCAAAC TTAG
 
Protein sequence
MSDPNTIRIR GARQHNLKNI DLDLPRDRLI VFTGVSGSGK SSLAFDTIFA EGQRRYVESL 
SAYARQFLGQ LDKPDVDAIE GLSPAISIDQ KSTSHNPRST VGTVTEIYDY LRLLFGRAGE
PHCPICDRSI SPQNIDQMCD RVMELPDRTK FQILSPVVRG KKGTHKQLLS SLASQGFVRV
RINGEVRELS DAIDLNKNHH HNIEIVIDRL IKKPGIEERL VDSLTTCLKQ SEGLAVIDIL
DDEDKDNQEE KVPSEITFSE NFACPEHGAV IEELSPRLFS FNSPYGACPH CHGLGSLRQF
SADLVIPDPS ASLYAAIAPW SDKDNSYYLS LLYSVGQTCG FDIQTPWNKL TKEQQNILLY
GQEEPIWFED DSRSKNSEGY YRKFGGILAM LERSYQETSS EIIKQKLEKY IVDRTCEVCQ
GKRLKPEALS VRLGQYKIDQ LTSVSIDKCL ERVNQLELTP RQALIGELAL KEIKNRLQFL
LDVGLDYLTL DRGTMTLSGG EAQRIRLATQ IGSGLTGVLY VLDEPSIGLH QRDNQRLLNT
LRKLRDLGNT LIVVEHDQET IECADHLVDI GPLAGVHGGK IVCQGNLETL LSDQTSLTGA
YLSGRKVIET PEKRRKGNGS SLQLKNCCQN NLKNINVEIP LGKLVCITGL SGSGKSTLVN
ELLYPALQHH LTRQVPFPKN LEQIKGLKAV DKVIVIDQSP IGRTPRSNPA TYTGVFDTIR
ELFSQTIEAK ARGYKQGQFS FNVKGGRCEV CNGQGVNIIE MNFLPDVYVQ CDVCKGARYN
RETLQVKYKD YSIADVLNMT VEEALDVFQN IPKAVKRLQT LVDVGLGYIK LGQSAPTLSG
GEAQRLKLAS ELSKRATGKT LYLIDEPTTG LSFYDVHHLL NVLQRLVDKG NSILVIEHNL
DVIRCADWII DLGPEGGDKG GEIIALGTPE EVANNPNSYT GKYLKQALQQ HPTAKQT