Gene Ava_1709 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_1709 
Symbol 
ID3682189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp2142223 
End bp2145249 
Gene Length3027 bp 
Protein Length1008 aa 
Translation table11 
GC content45% 
IMG OID637717049 
Productexonuclease SbcC 
Protein accessionYP_322226 
Protein GI75907930 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCCAG TACAACTTGT CCTAAAAAAT TTTCTGAGTT ACCGTGATGC AACTTTAGAT 
TTTCGTGGTT TGCATACGGC TTGTATTTGT GGTGCCAATG GAGCAGGTAA ATCGTCTCTC
CTGGAAGCCA TCACATGGGC GCTATGGGGT GAAAGTCGTG CGGCGGTGGA AGATGATGTG
ATCAATTCTG GTGAAAAAGA AGTTAGGGTT GATTTTACTT TTCAAAATAA TCAGCAAAAA
TATCGCGTAA TTCGCAGTAG AATTCGGGGT GCATCGGGGA TTTTGGAATT TCAAATCGAA
ACACCCTCTG GCTTCCGGGC AATTACTGGT AAAGGTGTCC GCGCCACCCA GGATTTGATT
TTAGAACACA TCAAGCTTGA TTATGAGACT TTTATTAACT CTGCTTACCT ACGTCAAGGA
AGAGCAGATG AATTTATGCT CAAGCGTCCC AGTGAACGCA AGGAAATTTT GGCGGAGTTG
TTGAAACTAA ACCAATATGA TGAGTTGGAA GAACGAGCCA AAGATTCATC GCGTCAATTT
AAAGTGAGGG CGGAAGAATT AGAACGTTCT TTGGAGTCAA TTAAAACTCA GTTACAACAA
CGGGAAAACA CCAAAGCCCA ACGGGCAGAA TTAGAAGCGG AAATTAACCA ACTGCAACAG
GTACAAGCCT TTGAGACGAT TCAATTGCAG AGTTTGCAGG TGATTCAACA CCAGAGGCAA
AATAGCGAAC AGCAGTTAAA TTTTGTGAGG CAACAATACC AAAATCTTAC ACAAGATAGC
GATCGCCTCC AACAAGAACA GTCAGCAGTC AAAACACAGT TAGCTGAGTT AGAAGCTATC
TTGCACCGGG AAGCAGAGAT TCAAAACGGC TACAGCCATT ACCAAAGTCT GCAATCCCAA
GAAGAAGTAT TTGCTGTCAA GTTTGACGAA CATACCCGCG CCACAGCCCT ACGCCAACAA
AAGCAGCAAC AGTTAACTAA ACAAATTCAC GAACTGGAAA GACAACTCCA ACAAGTCCAA
GGACAATTGG AAGCCTTAAA GCAACAAGAA CAAGAAATTC AGCACACCTT GAGCAAATCC
GGGGAAATAG AAGCCGCCTT AGCCCAACTC GCCGCCGCCC GTCGCCATTT AGCGCACCTA
GATGAACTGC AAATGCAGGT TTCCCCTTTG TTGCAACAGC GCCAACATCT ACAAAGCCAT
TTAGACCGGG TTCATGCGGG GTTAGTCGCC AGGCTCGAAC AACTACAAGC CACAGAGAAT
CAATTGCAAC GAGCCTCCCA ACGCCAACCC CAACTGCAAC AAGCGGTCAT GGAAGTGGCG
ATGCAAATTG ATCAAATGGA GAAAGACCGG GTTTATTTGC AACGGGTGCA GGAAAAAGGC
CACGAAAGAC GGAATTTTAT AGAACGTCTG CAAGCCCAAC AACGGGAATA TGAAAGATTA
TTGGGAGAAA TGGAGCAGAA ATTGCAAATG CTCCGCACCC CCGATGCTAT CTGTCCTTTG
TGCGAACGTC CTTTAGATGA ACATCATTGG AATCGGGTAG TAGATAAAAC CAAAGATGAG
TATAAGGAAA CCGAGGGGCA ATTGTGGGTG TTTCGTGAGC AGATGGCAGT TTCCGATAGA
GAAATTCAGG TATTAAGGCA GGAATATCGG GAAATATCCC AGAAATTAAA TGCTTACGAT
GCGTTACGAG AACAACGAGG ACAATTAGCT GCCCAACTGC AATCTACCAG CGATGCTGAA
CAACAATTAC AACAATTGGC AGCCGAAAAA CAACACTTAG AGCGATCGCT GCAAGTTGGT
GATTTTGCCC TTGATCAACA AGCTGAATTG CGGCAATTAG AACAATATCT GCAACAATTA
AATTACAATG AGCAAGACCA TGCCCTGGCT CGTAGCGAAG TGGAACGGTG GCGATGGGCA
GAAATTAAGC AAGGGCAAAT CAAGGATGCT AGCAAACGCC TGGCGACATT AGCAGCCCGT
AAACCAGAAT TACAAGCTCA AATTACCCAA CTACAAACCA GAATCCAGCA GGAGCAAACT
GATTCGGAAG ATGCTCAACA AATCGCAGCC CTTGAGGAGC AAATCGCAGA ACTTGGTTAT
AGTTCTGAGC AGCATAACAA CCTACGTCTC GCGGTGCGCC AGTCCCAAGC TTGGCAGTTG
CGTTACCAGC AGCTGTTATC AGCCCAGCAA CAGTATCCCC AACTCCAGGG CAGATTGCAA
GATTTGGAAG TCTCTAAACT TGCCAGATTG CAGGAAAGAC AACAACTCGC CACCCAAATC
GATAGCCTTG TGGAGCAGTT AGCCCAAGCC GCTAACCCCA GGGAGCAAAT TCAAGCTTTA
GAACAACAGT TAGCCACAAG ACGGCGACAA CTAGACGAAC AAATCACCCA GTTAGGACGT
TTAGAACAGT TAGCCCATCA GTTGGAAACA CTGCAAACTC AATATGAGGA GCAGCAGCAA
CAATACCAAA CTTGCAAGCA GCAATACCGG GTATATCAGG AATTAGCCCA GGCGTTTGGT
AAAAATGGTA TCCAAGCTTT GATGATTGAG AACGTGTTGC CCCAACTAGA AGCCGAGACA
AATCAACTTT TATCCAGGCT GAGTGCGAAT CAACTGCACG TACAATTTAT TACTCAAAGA
GCAGGAAAGG GCAGTAAATC TACCAAGAAA AATGCCAAAC TCATAGATAC CTTAGATATT
CTCATTGCCG ATGCCAGAGG TACAAGAGCT TATGAAACCT ACTCCGGTGG GGAAGCTTTT
AGAATTAACT TTGCGATTCG TTTAGCCTTA GCGAAACTAT TAGCCCAAAG AGCCGGAGCC
GCATTGCAAC TATTGATTGT AGACGAAGGT TTTGGTACAC AAGATGCGGA AGGATGCGAT
CGCCTCATTG CTGCCATTAA TGCGATCGCT AACGATTTCG CCTGCATCCT CACTGTTACC
CATATGCCCC ACCTTAAGGA GGCGTTTCAA GCCAGAATTG AGGTAGATAA AACGCAACAG
GGGTCGAGGA TTCGGTTGTT AACTTAA
 
Protein sequence
MIPVQLVLKN FLSYRDATLD FRGLHTACIC GANGAGKSSL LEAITWALWG ESRAAVEDDV 
INSGEKEVRV DFTFQNNQQK YRVIRSRIRG ASGILEFQIE TPSGFRAITG KGVRATQDLI
LEHIKLDYET FINSAYLRQG RADEFMLKRP SERKEILAEL LKLNQYDELE ERAKDSSRQF
KVRAEELERS LESIKTQLQQ RENTKAQRAE LEAEINQLQQ VQAFETIQLQ SLQVIQHQRQ
NSEQQLNFVR QQYQNLTQDS DRLQQEQSAV KTQLAELEAI LHREAEIQNG YSHYQSLQSQ
EEVFAVKFDE HTRATALRQQ KQQQLTKQIH ELERQLQQVQ GQLEALKQQE QEIQHTLSKS
GEIEAALAQL AAARRHLAHL DELQMQVSPL LQQRQHLQSH LDRVHAGLVA RLEQLQATEN
QLQRASQRQP QLQQAVMEVA MQIDQMEKDR VYLQRVQEKG HERRNFIERL QAQQREYERL
LGEMEQKLQM LRTPDAICPL CERPLDEHHW NRVVDKTKDE YKETEGQLWV FREQMAVSDR
EIQVLRQEYR EISQKLNAYD ALREQRGQLA AQLQSTSDAE QQLQQLAAEK QHLERSLQVG
DFALDQQAEL RQLEQYLQQL NYNEQDHALA RSEVERWRWA EIKQGQIKDA SKRLATLAAR
KPELQAQITQ LQTRIQQEQT DSEDAQQIAA LEEQIAELGY SSEQHNNLRL AVRQSQAWQL
RYQQLLSAQQ QYPQLQGRLQ DLEVSKLARL QERQQLATQI DSLVEQLAQA ANPREQIQAL
EQQLATRRRQ LDEQITQLGR LEQLAHQLET LQTQYEEQQQ QYQTCKQQYR VYQELAQAFG
KNGIQALMIE NVLPQLEAET NQLLSRLSAN QLHVQFITQR AGKGSKSTKK NAKLIDTLDI
LIADARGTRA YETYSGGEAF RINFAIRLAL AKLLAQRAGA ALQLLIVDEG FGTQDAEGCD
RLIAAINAIA NDFACILTVT HMPHLKEAFQ ARIEVDKTQQ GSRIRLLT