Gene Ava_3591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_3591 
SymboluvrA 
ID3679361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp4472899 
End bp4475796 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content42% 
IMG OID637718942 
Productexcinuclease ABC subunit A 
Protein accessionYP_324092 
Protein GI75909796 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.240208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0431372 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGATA ATAAGCTAGC CGCATCCCTG AATGGACATC TTCCCTACGC CAATCACAAC 
AGCCAGAATA CCATTAGAAT TCGGGGTGCG CGACAGCATA ATCTGAAGAA TATTGACTTG
GAATTGCCAC GCGATCGCCT AATCGTGTTT ACTGGCGTAT CAGGTTCGGG TAAGTCTTCT
TTGGCGTTTG ATACCATCTT CGCCGAAGGT CAACGGCGCT ATGTGGAATC TCTCAGCGCC
TACGCCAGAC AATTTTTAGG ACAATTAGAT AAACCGGATG TGGAAGCCAT CGAAGGTTTA
AGCCCAGCGA TTTCCATTGA CCAAAAATCT ACATCTCACA ACCCCCGTTC TACTGTGGGG
ACGGTAACAG AAATTTACGA CTATTTGCGA CTGTTGTTTG GTCGCGCTGG TGAACCCCAT
TGTCCCATCT GCGATCGCTG TATTGCACCC CAAACCATTG ATGAGATGGT TGATAGGATT
ATGGAACTAC CAGACAGGAC TCGCTTCCAA ATTCTCGCGC CTGTTGTTAG GGGTAAGAAA
GGAACTCACC GCAAATTATT ATCTAGTTTG GCTTCCCAAG GCTTTGTTCG GGTGCGGGTT
GATGGCGAAG TCCGCGAACT TTCCGACTCG ATTGAATTAG ATAAAAATAT TACTCATACC
ATTGAGGTTG TCATTGACCG ACTGGTAAAA AAAGACGGTA TTCAAGAACG TTTAGTAGAT
TCTTTGTCTA CGTGTCTCAA ACAAGCTGGT GGTATTGCTA ACATATTAAT CAGTAATTCA
TCAACAACGG ACAACGGACA AGAGACAACT GACGATGAAG AATTAGTATT TTCGGAAAAC
TTTGCTTGTC CTGAACATGG CGCAGTTATG GAGGAGTTAT CACCGCGTTT GTTCTCCTTT
AACTCCCCCT ATGGTGCTTG TCCCAACTGT CACGGTCTTG GCACTTTACG CAGATTTTCA
CCAGAGTTAG TCGTACCTGA CCCTGAGGCT CCAGTATATG CAGCGATCGC TCCTTGGTCA
GAAAAAGAAA ATTCCTATTA CCTGGAACTA CTATATAGCT TGGGACAAAC TCATAATTTT
GAGTTGCAAA TAAATTGGCA TAAACTCACC CCAGAACAAC AGCAAATTAT TTTGTATGGG
GAGAAACAGG AAGGTAAAGA TAATCCTAAG ACACCAAGTT TTAAAGGTGT ATTGCCAATT
TTACAACGCC AATATGAAGG CGGTTCCGAA TTAATTAAGC AGAAATTAGA GCAGTATTTA
ATTGACCAAC CATGTGAAGT TTGTCACGGC AAACGGTTAA AACCAGAAGC CTTGGCGGTG
AAGTTGGGAC AATATAATAT CTTAGATTTA ACTGGAGTTT CGATTCGGGA TTGCCGAGAG
AGAACAGAAC AATTAAAATT AAGCGATCGG CAGATGCAAA TTGCTGATTT AGTCTTGCGA
GAAGTTAAAG CTAGATTACA ATTTTTGTTA GATGTCGGCT TAGATTACCT CACCCTAGAC
CGTGCCGCCA TGACCCTCTC TGGAGGGGAA GCCCAACGCA TTCGCCTCGC TACACAAATT
GGCTCTGGCT TAACAGGTGT TCTCTATGTT TTAGATGAAC CAAGCATTGG TTTACACCAA
AGAGATAACG CAAGATTACT CAAAACTTTA ACTAAATTAA GGGATTTAGG TAATACCTTA
ATTGTAGTCG AACACGACGA AGAAACAATC CGCGCTGCGG ACTATCTAGT TGATATTGGG
CCAGGTGCGG GTATCCACGG CGGTAATATT ATTTCTCAAG GCGATTTGCA AGCTTTATTA
ACAGCAGAAG AGTCTCTTAC TGGTGCATAT TTATCAGGAC GTAAGGTCAT TAATACACCA
GGAGAACGCC GTGAAGGAAA TGGTCGGAGT TTAACCATTA AAAATGCCCA TCGCAATAAT
TTAAGAAATA TTGATGTAGA AATTCCTTTA GGTAAATTAG TTGCTGTCAC GGGAGTATCT
GGTTCTGGTA AATCGACATT AATTAACGAA TTACTTTACC CATCACTACA ACATCATCTA
ACTAAAAAAG TTCCTTTACC TAAAGAGTTA GAAAAAATTC AGGGGTTGAG TGCAGTAGAT
AAAGCGATCG TCATCGACCA ATCACCCATC GGACGCACAC CACGTTCTAA CCCCGCAACT
TACACAGGAG TTTTTGACGT AATTCGGGAC GTATTTTCAC AAACAGTAGA AGCAAAAGCC
AGAGGTTACA AACCCGGACA ATTTTCTTTC AACGTTAAAG GTGGACGTTG TGAAGCCTGT
AGCGGACAGG GTGTAAATGT CATTGAGATG AACTTTCTTC CTGATGTTTA TGTGCAGTGT
GAAATTTGCA AAGGTGCAAG ATACAACCGC GAAACGTTGC AAGTGAAATA TAAAGATAAA
TCTATTTCCG ATGTTCTCAA CATGACTGTT GAGGAGAGTT TAGATTTCTT CCAAAACATC
CCCAAAGCTG CTACAAGATT GCAAACTTTA GTTGATGTTG GATTAGGCTA TGTTCAACTA
GGACAGCCTG CAACTACCTT ATCTGGTGGT GAAGCGCAAA GAGTCAAATT AGCCACAGAA
TTATCTCGCC GCGCTACAGG TAAGACCCTG TATTTAATCG ATGAACCCAC CACCGGCTTA
TCTTTTTATG ATGTCCACAA ATTACTAGAT GTGTTGCAAA GATTGGTCGA TAAAGGTAAT
TCGATTTTGG TAATTGAACA CAACTTAGAT GTAATTCGTT GTGCTGATTG GGTCATTGAT
TTAGGGCCAG AAGGTGGCGA CAAGGGGGGA GAAGTGATTG CTGTCGGGAC ACCAGAGGAA
GTTGCTAAAA ATACCAGTTC TTATACTGGG CAATATTTGC AGCAGGTATT GCAACAATAT
CCTGCATTAA AAGATTAA
 
Protein sequence
MSDNKLAASL NGHLPYANHN SQNTIRIRGA RQHNLKNIDL ELPRDRLIVF TGVSGSGKSS 
LAFDTIFAEG QRRYVESLSA YARQFLGQLD KPDVEAIEGL SPAISIDQKS TSHNPRSTVG
TVTEIYDYLR LLFGRAGEPH CPICDRCIAP QTIDEMVDRI MELPDRTRFQ ILAPVVRGKK
GTHRKLLSSL ASQGFVRVRV DGEVRELSDS IELDKNITHT IEVVIDRLVK KDGIQERLVD
SLSTCLKQAG GIANILISNS STTDNGQETT DDEELVFSEN FACPEHGAVM EELSPRLFSF
NSPYGACPNC HGLGTLRRFS PELVVPDPEA PVYAAIAPWS EKENSYYLEL LYSLGQTHNF
ELQINWHKLT PEQQQIILYG EKQEGKDNPK TPSFKGVLPI LQRQYEGGSE LIKQKLEQYL
IDQPCEVCHG KRLKPEALAV KLGQYNILDL TGVSIRDCRE RTEQLKLSDR QMQIADLVLR
EVKARLQFLL DVGLDYLTLD RAAMTLSGGE AQRIRLATQI GSGLTGVLYV LDEPSIGLHQ
RDNARLLKTL TKLRDLGNTL IVVEHDEETI RAADYLVDIG PGAGIHGGNI ISQGDLQALL
TAEESLTGAY LSGRKVINTP GERREGNGRS LTIKNAHRNN LRNIDVEIPL GKLVAVTGVS
GSGKSTLINE LLYPSLQHHL TKKVPLPKEL EKIQGLSAVD KAIVIDQSPI GRTPRSNPAT
YTGVFDVIRD VFSQTVEAKA RGYKPGQFSF NVKGGRCEAC SGQGVNVIEM NFLPDVYVQC
EICKGARYNR ETLQVKYKDK SISDVLNMTV EESLDFFQNI PKAATRLQTL VDVGLGYVQL
GQPATTLSGG EAQRVKLATE LSRRATGKTL YLIDEPTTGL SFYDVHKLLD VLQRLVDKGN
SILVIEHNLD VIRCADWVID LGPEGGDKGG EVIAVGTPEE VAKNTSSYTG QYLQQVLQQY
PALKD