Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_0581 |
Symbol | |
ID | 3678611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | - |
Start bp | 729107 |
End bp | 732040 |
Gene Length | 2934 bp |
Protein Length | 977 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637715909 |
Product | DNA polymerase I |
Protein accession | YP_321100 |
Protein GI | 75906804 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAA ACTCTCCTTC TGTAACTACA ACTACTACAC GCCCCACGTT CATCCTTGTA GATGGACACT CCCTGGCTTT CCGTTCATAC TTTGCCTTTG CTAAAGGACG GGATGGTGGG CTGCGTACTA AGACAGGTAT TCCTACCAGT GTTTGCTTTG GCTTTCTAAA GTCTTTACTG GAAGTTATGG CTACACAACA ACCGCAAGCA ATGGCGATCG CTTTTGATTT GGGTTTACCA ACTTTCCGCC ACGAAGCCGA CGATACCTAT AAAGCTGATC GCCCCGGAAC GCCAGAGGAT TTTGTCCCTG ATTTAAAAAA CCTCCATGAA TTACTGGAAG GCTTCAACCT ACCAATTTAT ACAGCCCCCG GTTATGAAGC TGATGATGTT TTAGGAACCT TGGCACAAAA AGCCACCGCC GAGGGGTATA AGGTAAAAAT TTTGACAGGC GATCGCGATT TGTTTCAATT AGTAGATCCC GAAAAAGAAA TCACTGTCTT AAACTTCAGT CCTGATGCGC TCAAACGTTC TACCAACAGC ATTACGGAAG TTAGCACTGA ACAAGTTAAA GAAAGATTAG GTGTCTTACC TACTCAGGTT GTTGATTTTA AAGCCCTCTG TGGCGATAAA TCAGATAATA TTCCCGGTGT GAGAGGTATC GGGGAAAAAA CCGCAGTTCA GTTACTTAGT ACCTATGGTT CCCTTGATGG AGTGTACAAA GCATTAAATG AAATTAAAGG CGCGACTCAG AAGAAACTAG CAGAAGGAAA AGAAGATGCG GCTAAGTCGC AATTTTTGGC AAAGATAGTT GTAGATGTAC CCTTAGAAGT TAATTTAGAA GATTGTAAAT TAGTAGGATT TGATGAAAGC CTGCTCATTC CTATTTTAGA AAAACTAGAA TTTACTCGTT TCTTATCCCA AATCAACGAC CTACAGCGAC GCTTTGGTGG CACAGTTGTA GAAGTAGAAA AAACTGAAGT AGCGGCGATA GAAACCACCA AATTTGAAGA TGATGATATT GCTTTTTACA CTGCTGAAGA AACACAAGCA ATTCAACAAA ATCAACCAGC TTCACCAATT CAACCACGGA TTATTAATAC TGAAGCCAAA CTCATAGAGT TGGTGCAGAT ATTACAACAA TTCACCAACC CAGAAACTCC CGTTGCTTGG GACACGGAAA CTAGCGGCTT AGAACCACGG GACGCGGCTT TGGTAGGGAT TGGCTGCTGC TGGGGAACTG AAGCCGATGC TAGTGCTTAT ATTCCCCTGG CTCACACAAA AGGGGAAAAT TTAAGTCAGG ATATTGTCCT TACGGCGCTG CGTCCGATTT TAGAAAGTGC TGATTATCCG AAGACTTTTC AAAATGCTAA ATTTGACCGT TTAGTTTTTC TAGTTCAAGG AATTAATTTA ACTGGCGTTG TATTTGATCC GATGTTGGCT AGTTATGTTT TAAATCCAGA TACTAGCCAT AATTTAACTG ACCTAACTCA GCGTTATTTG GGGTTAACAA TTCAAAATTA TGTAGATTTA GTTCCTAAAG GTAAAACTAT CGCGGATATA AATATCTCTG CTGTAGCTAA TTATTGCTGT TTACAAGTTT ATGCCACATT CCAACTTGTA GCTAAATTGC GGGAAGAACT AGCCAAAACT CCTAATTTGG ATAAATTGCT TAGGGAAGTA GAACAGCCGC TAGAATCCGT TTTAGCAACA GTAGAATATA CTGGTGTCCG GATTGATTCA GCTTATCTAC ATGAATTATC CCAACAATTA GAAATAGATT TAGCTAAATT ACAGGAACAA GCAACTGAAC TAGCTGGGGA AAAATTTAAT TTAGGTTCTC CCAAACAATT GAGTTATATA TTGTTTGAAA AATTGGGATT AAGTACTAAA TATTCACGGA AGATTCAGAC TGGTTACTCT ACAGATGCAG CAACTTTAGA GAAACTCCAA GAAGTTGATG AAACAGGCTT TGTTAATGCC ATTACTGAGT ATCGTACTTT ATCGAAATTA AAATCTACTT ATGTAGATGC TTTGCCAGCA TTGGTACGTC CAGACACTCA ACGAGTACAT ACAGATTTTA ATCAAACAGC AACGTCAACT GGTAGGCTAT CTTCATCTAA TCCTAATTTA CAAAACATCC CCATTCGTAC GGCTTTTAGT CGGCAAATTC GCAAGGCATT TTTACCAAAA TCAGGTTGGT TAATGGCGGC GGCTGATTAC TCACAAATTG AGTTACGAAT TTTGGCTCAT TTGAGTCAAG AACCGTTATT AGTGCAAGCC TATCAGCAGA ATGAAGATAT TCATACAGTC ACCGCCAAAT TAGTTTTTGA AAAAGATGAT ATCAATTCAG ATGAGCGCAG AATAGCCAAA ACTATTAACT TTGGTGTGAT TTATGGTATG GGTTCTTTAA AATTTTCTCG TTCAACGGGA ATAGATAAAA ATGTAGCGAA CGAATTTATT AAAAGATTTA ACGAACGCTA CGCCAAAATC TTTGCTTATT TGGAAGGAGT GAAAAAACAA GCGATATCTC TTGGTTATGT TGAGACGATA CTCGGTCGGC GGCGGTATTT TAATTTTGAT AGCACAAGTT TGCTTAACTT AAAAGGTAGC AACCCAGAAG ATATTGATTT AAGCAAATTG AAAAATTTAG GTGCTAAAGA TGCTGGTTTA TTGCGGTCTG CTGCTAATGC CCCTATTCAA GGTTCTAGTG CTGATATTAT CAAAATCGCA ATGGTGCAGA TTCATGAAGT TCTCAAGAAA TATCAAGCGC GGTTACTGTT ACAAGTCCAC GATGAATTAG TCTTTGAAGT TCCTCCCCAA GAATGGGCAG AATTACAACC ACAAATTAAA TCAGTGATGG AAAATGCAGT TAAGTTAAGT GTGCCGTTAG TGGTTGATGT ACGTGCAGGG GAAAACTGGA TGGAAACGAA GTGA
|
Protein sequence | MSENSPSVTT TTTRPTFILV DGHSLAFRSY FAFAKGRDGG LRTKTGIPTS VCFGFLKSLL EVMATQQPQA MAIAFDLGLP TFRHEADDTY KADRPGTPED FVPDLKNLHE LLEGFNLPIY TAPGYEADDV LGTLAQKATA EGYKVKILTG DRDLFQLVDP EKEITVLNFS PDALKRSTNS ITEVSTEQVK ERLGVLPTQV VDFKALCGDK SDNIPGVRGI GEKTAVQLLS TYGSLDGVYK ALNEIKGATQ KKLAEGKEDA AKSQFLAKIV VDVPLEVNLE DCKLVGFDES LLIPILEKLE FTRFLSQIND LQRRFGGTVV EVEKTEVAAI ETTKFEDDDI AFYTAEETQA IQQNQPASPI QPRIINTEAK LIELVQILQQ FTNPETPVAW DTETSGLEPR DAALVGIGCC WGTEADASAY IPLAHTKGEN LSQDIVLTAL RPILESADYP KTFQNAKFDR LVFLVQGINL TGVVFDPMLA SYVLNPDTSH NLTDLTQRYL GLTIQNYVDL VPKGKTIADI NISAVANYCC LQVYATFQLV AKLREELAKT PNLDKLLREV EQPLESVLAT VEYTGVRIDS AYLHELSQQL EIDLAKLQEQ ATELAGEKFN LGSPKQLSYI LFEKLGLSTK YSRKIQTGYS TDAATLEKLQ EVDETGFVNA ITEYRTLSKL KSTYVDALPA LVRPDTQRVH TDFNQTATST GRLSSSNPNL QNIPIRTAFS RQIRKAFLPK SGWLMAAADY SQIELRILAH LSQEPLLVQA YQQNEDIHTV TAKLVFEKDD INSDERRIAK TINFGVIYGM GSLKFSRSTG IDKNVANEFI KRFNERYAKI FAYLEGVKKQ AISLGYVETI LGRRRYFNFD STSLLNLKGS NPEDIDLSKL KNLGAKDAGL LRSAANAPIQ GSSADIIKIA MVQIHEVLKK YQARLLLQVH DELVFEVPPQ EWAELQPQIK SVMENAVKLS VPLVVDVRAG ENWMETK
|
| |