Gene Ava_0581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_0581 
Symbol 
ID3678611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp729107 
End bp732040 
Gene Length2934 bp 
Protein Length977 aa 
Translation table11 
GC content39% 
IMG OID637715909 
ProductDNA polymerase I 
Protein accessionYP_321100 
Protein GI75906804 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAA ACTCTCCTTC TGTAACTACA ACTACTACAC GCCCCACGTT CATCCTTGTA 
GATGGACACT CCCTGGCTTT CCGTTCATAC TTTGCCTTTG CTAAAGGACG GGATGGTGGG
CTGCGTACTA AGACAGGTAT TCCTACCAGT GTTTGCTTTG GCTTTCTAAA GTCTTTACTG
GAAGTTATGG CTACACAACA ACCGCAAGCA ATGGCGATCG CTTTTGATTT GGGTTTACCA
ACTTTCCGCC ACGAAGCCGA CGATACCTAT AAAGCTGATC GCCCCGGAAC GCCAGAGGAT
TTTGTCCCTG ATTTAAAAAA CCTCCATGAA TTACTGGAAG GCTTCAACCT ACCAATTTAT
ACAGCCCCCG GTTATGAAGC TGATGATGTT TTAGGAACCT TGGCACAAAA AGCCACCGCC
GAGGGGTATA AGGTAAAAAT TTTGACAGGC GATCGCGATT TGTTTCAATT AGTAGATCCC
GAAAAAGAAA TCACTGTCTT AAACTTCAGT CCTGATGCGC TCAAACGTTC TACCAACAGC
ATTACGGAAG TTAGCACTGA ACAAGTTAAA GAAAGATTAG GTGTCTTACC TACTCAGGTT
GTTGATTTTA AAGCCCTCTG TGGCGATAAA TCAGATAATA TTCCCGGTGT GAGAGGTATC
GGGGAAAAAA CCGCAGTTCA GTTACTTAGT ACCTATGGTT CCCTTGATGG AGTGTACAAA
GCATTAAATG AAATTAAAGG CGCGACTCAG AAGAAACTAG CAGAAGGAAA AGAAGATGCG
GCTAAGTCGC AATTTTTGGC AAAGATAGTT GTAGATGTAC CCTTAGAAGT TAATTTAGAA
GATTGTAAAT TAGTAGGATT TGATGAAAGC CTGCTCATTC CTATTTTAGA AAAACTAGAA
TTTACTCGTT TCTTATCCCA AATCAACGAC CTACAGCGAC GCTTTGGTGG CACAGTTGTA
GAAGTAGAAA AAACTGAAGT AGCGGCGATA GAAACCACCA AATTTGAAGA TGATGATATT
GCTTTTTACA CTGCTGAAGA AACACAAGCA ATTCAACAAA ATCAACCAGC TTCACCAATT
CAACCACGGA TTATTAATAC TGAAGCCAAA CTCATAGAGT TGGTGCAGAT ATTACAACAA
TTCACCAACC CAGAAACTCC CGTTGCTTGG GACACGGAAA CTAGCGGCTT AGAACCACGG
GACGCGGCTT TGGTAGGGAT TGGCTGCTGC TGGGGAACTG AAGCCGATGC TAGTGCTTAT
ATTCCCCTGG CTCACACAAA AGGGGAAAAT TTAAGTCAGG ATATTGTCCT TACGGCGCTG
CGTCCGATTT TAGAAAGTGC TGATTATCCG AAGACTTTTC AAAATGCTAA ATTTGACCGT
TTAGTTTTTC TAGTTCAAGG AATTAATTTA ACTGGCGTTG TATTTGATCC GATGTTGGCT
AGTTATGTTT TAAATCCAGA TACTAGCCAT AATTTAACTG ACCTAACTCA GCGTTATTTG
GGGTTAACAA TTCAAAATTA TGTAGATTTA GTTCCTAAAG GTAAAACTAT CGCGGATATA
AATATCTCTG CTGTAGCTAA TTATTGCTGT TTACAAGTTT ATGCCACATT CCAACTTGTA
GCTAAATTGC GGGAAGAACT AGCCAAAACT CCTAATTTGG ATAAATTGCT TAGGGAAGTA
GAACAGCCGC TAGAATCCGT TTTAGCAACA GTAGAATATA CTGGTGTCCG GATTGATTCA
GCTTATCTAC ATGAATTATC CCAACAATTA GAAATAGATT TAGCTAAATT ACAGGAACAA
GCAACTGAAC TAGCTGGGGA AAAATTTAAT TTAGGTTCTC CCAAACAATT GAGTTATATA
TTGTTTGAAA AATTGGGATT AAGTACTAAA TATTCACGGA AGATTCAGAC TGGTTACTCT
ACAGATGCAG CAACTTTAGA GAAACTCCAA GAAGTTGATG AAACAGGCTT TGTTAATGCC
ATTACTGAGT ATCGTACTTT ATCGAAATTA AAATCTACTT ATGTAGATGC TTTGCCAGCA
TTGGTACGTC CAGACACTCA ACGAGTACAT ACAGATTTTA ATCAAACAGC AACGTCAACT
GGTAGGCTAT CTTCATCTAA TCCTAATTTA CAAAACATCC CCATTCGTAC GGCTTTTAGT
CGGCAAATTC GCAAGGCATT TTTACCAAAA TCAGGTTGGT TAATGGCGGC GGCTGATTAC
TCACAAATTG AGTTACGAAT TTTGGCTCAT TTGAGTCAAG AACCGTTATT AGTGCAAGCC
TATCAGCAGA ATGAAGATAT TCATACAGTC ACCGCCAAAT TAGTTTTTGA AAAAGATGAT
ATCAATTCAG ATGAGCGCAG AATAGCCAAA ACTATTAACT TTGGTGTGAT TTATGGTATG
GGTTCTTTAA AATTTTCTCG TTCAACGGGA ATAGATAAAA ATGTAGCGAA CGAATTTATT
AAAAGATTTA ACGAACGCTA CGCCAAAATC TTTGCTTATT TGGAAGGAGT GAAAAAACAA
GCGATATCTC TTGGTTATGT TGAGACGATA CTCGGTCGGC GGCGGTATTT TAATTTTGAT
AGCACAAGTT TGCTTAACTT AAAAGGTAGC AACCCAGAAG ATATTGATTT AAGCAAATTG
AAAAATTTAG GTGCTAAAGA TGCTGGTTTA TTGCGGTCTG CTGCTAATGC CCCTATTCAA
GGTTCTAGTG CTGATATTAT CAAAATCGCA ATGGTGCAGA TTCATGAAGT TCTCAAGAAA
TATCAAGCGC GGTTACTGTT ACAAGTCCAC GATGAATTAG TCTTTGAAGT TCCTCCCCAA
GAATGGGCAG AATTACAACC ACAAATTAAA TCAGTGATGG AAAATGCAGT TAAGTTAAGT
GTGCCGTTAG TGGTTGATGT ACGTGCAGGG GAAAACTGGA TGGAAACGAA GTGA
 
Protein sequence
MSENSPSVTT TTTRPTFILV DGHSLAFRSY FAFAKGRDGG LRTKTGIPTS VCFGFLKSLL 
EVMATQQPQA MAIAFDLGLP TFRHEADDTY KADRPGTPED FVPDLKNLHE LLEGFNLPIY
TAPGYEADDV LGTLAQKATA EGYKVKILTG DRDLFQLVDP EKEITVLNFS PDALKRSTNS
ITEVSTEQVK ERLGVLPTQV VDFKALCGDK SDNIPGVRGI GEKTAVQLLS TYGSLDGVYK
ALNEIKGATQ KKLAEGKEDA AKSQFLAKIV VDVPLEVNLE DCKLVGFDES LLIPILEKLE
FTRFLSQIND LQRRFGGTVV EVEKTEVAAI ETTKFEDDDI AFYTAEETQA IQQNQPASPI
QPRIINTEAK LIELVQILQQ FTNPETPVAW DTETSGLEPR DAALVGIGCC WGTEADASAY
IPLAHTKGEN LSQDIVLTAL RPILESADYP KTFQNAKFDR LVFLVQGINL TGVVFDPMLA
SYVLNPDTSH NLTDLTQRYL GLTIQNYVDL VPKGKTIADI NISAVANYCC LQVYATFQLV
AKLREELAKT PNLDKLLREV EQPLESVLAT VEYTGVRIDS AYLHELSQQL EIDLAKLQEQ
ATELAGEKFN LGSPKQLSYI LFEKLGLSTK YSRKIQTGYS TDAATLEKLQ EVDETGFVNA
ITEYRTLSKL KSTYVDALPA LVRPDTQRVH TDFNQTATST GRLSSSNPNL QNIPIRTAFS
RQIRKAFLPK SGWLMAAADY SQIELRILAH LSQEPLLVQA YQQNEDIHTV TAKLVFEKDD
INSDERRIAK TINFGVIYGM GSLKFSRSTG IDKNVANEFI KRFNERYAKI FAYLEGVKKQ
AISLGYVETI LGRRRYFNFD STSLLNLKGS NPEDIDLSKL KNLGAKDAGL LRSAANAPIQ
GSSADIIKIA MVQIHEVLKK YQARLLLQVH DELVFEVPPQ EWAELQPQIK SVMENAVKLS
VPLVVDVRAG ENWMETK