Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_17170 |
Symbol | |
ID | 7760652 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1699773 |
End bp | 1702478 |
Gene Length | 2706 bp |
Protein Length | 901 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643804616 |
Product | CRISPR-associated helicase Cas3, core |
Protein accession | YP_002798905 |
Protein GI | 226943832 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.908442 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACGA CACGCTTTTC CGAACTGCCT GCCTGCAGTC GCGCGCTCTG GGCCAAGAGC GGCGAGCCGG GAGGGCATGG CCTGCTGGCG CACATGCTGG ACGTGGCGGC TGTCGCCGAG ACGCTTCTGG AACGCGAGCC GCCAGCCACG CACGAATGCT TCGCCGCGCA GTTCGGCTTG CCGGAGTCCG TTTTTTCCCG GCATGTCGCC GCGCTGGCGG GGTTGCACGA TTTCGGCAAG GCCATTCCCG GTTTCCAGGC CAAGTGGCCG GCGGGGCGGG AGCGTGACGA GGCGCATGGC CTGGGATTCG CCGAACCTTC CCTGCGCGTC ACCGATCACG CCTGCGCCGG CGCGGCCCTG CTCTGGCGGC ATCTGCCGGC ACAAGGTGCC GAGCCGGGCT GGATCATGGG CGTGCTGCAG GCGATCGGTG CGCACCACGG CTACAACCCC AGCGTTCCGG AAATCCGCAC GGCCGTGCCG GCGCTCGAGG CACCGGGTTG GGCCCCGGCG CGCCAGGCAC TGTTCGAGGC TTACTGGCAG GCCCTGGCCC CCGAAGGCGT ACCCGCGGCC GACGAGTTGC CGCTGCCCGC CGTGGCCTGG CTGGCCGGGC TGACCAGCGT GGCCGACTGG ATCGCATCCG ATCCCGACTG GTTCCCGCCC GGCGAGCGGG AGGATTCGCT CGCCGGGCAT TTCCGCCAGG CCGGGAAGCT CGCCGCCAAG GCTCTGGAAG AGATCGGCTG GAGTCCTCAT CGCGCCCTGT TGCGGGAGCC GGCGACGACG GACGAGTTGC TCGGCCGCAT CGTTCGCCGC GAGGGCGTGG CCGCCCGTTC CCTGCAGGTC GAGGGCGATC GCTTGTTGAG CGAGGCGCGG GGGCCGTCCC TGCTGCTGGT CGAGGCGCCG ATGGGGGAGG GCAAGACCGA GCTGGCCTTT CTCGCCCACC TGCGCCTGCA AGCGGCCAAC GACCATCGCG GCCTGTATGT CGCCCTGCCG ACCCAGGCGA TCGGCAATGC CCTGTTCGAC CGGGCACTGA CCTTCCTGCG CGGCTTCGAC GACCGCCAGC CGCTGGACCG CCAGCCGCTG GATCGCCAGC CGCTGGACAT TCAGTTGGTG CACGGCGGCG CCGCGCTGGA CGAACGGGTA CAGCGCCTGC GCGACATCCA CGGCGGGACG GACGACAGCA TCGCCTCGTC CGCCTGGTTC TCCCAGCGCC GCCGTCCGCT GCTGTCGCCC TATGGCGTCG GCACCGTCGA CCAGGCGCTG TTCGCCGCCC TGAACGTCAA GCACCATTTC GTTCGCCTCT GGGGCCTGGC CAACCGGGTG GTGGTACTGG ACGAGGTGCA TGCCTACGAC GTCTACACCG GCGGTCTGAT CGAGTCCCTG TTGCGCTGGC TCAGGGCGCT CGGTAGCTCG GTGGTGCTGA TGAGCGCGAC CCTGCCCGAA CAGCGGCGAA ATGCCTTGCT GCAAGCCTGG GGCGTGGCGA CGGAGCGGGT GCCGGAGCTG CCCTATCCGC GCCTGCTGCT GGCGGACGGC CGAGGCTTGC GGGCCACGAC CTTCGATTCC CGGCCGTTGC CGCCGATCGC CCTGGAAAGC CTCGACGAGG ACCTGGACGA TCTGGCCGCC TGCGCCCTGC GTCTGCTCGA CGGCGGCGGG CATGGCGCGC TGATCGTTAA CACGGTGGCG CGTGCCCAGG AACTCTACCG CCGCCTGCAG CCGCGGCTCG ACGAAGAGGT GCGGCTGATC CTGTTCCATG CCCGTTTCCC GGCCGACGAG CGCGGCATGC GGGAGCGGCA GGTGCTGGAG GCGTTCGGCA ACGCCGGCAC CGACCGGCCG CGTCCCGGCC GGTGCCTGCT GATCGCCACC CAGGTCGCCG AGCAATCCCT GGACATCGAT TTCGACTTCT TGATCACCGA CCTGGCACCG GTCGACCTGA TCCTGCAGCG CGCCGGACGC CTGCACCGCC ATCGGCGGGC CAGGGTGCCG GCCCATGCCG GGGCCCGGCT GTTCGTCGCC GGACTGGACC CGCGGCGCCT GCCCGGACTG GAGAAGACCG TCTACGAGCC CTACCTCCTC GGTCGCACCT GGGCCCTGTT GTCCCGCGAG ACGGGTCTGC GCTTGCCATG GGACATCGAC CGGCTGGTGC AGGCGGTCTA TGGCGACGCG CCGCTGCCGG ACGACCTGGA TGCGCAGGTA CGGGCGCGTA TCGAAGTCGA AATCCATGGC GAGTATCTCG CCCGTCTGAA AGAGGAGCGG CAGAAGGCGC TGAACATCGC CATCGACCCA TCCGCGGCGC CGCAGAACGC CTATCTCGAC AAGCCGCGCG GCAATGAGGA GGGCGAGGGG CCGGGCCTGA CCAAGCGCAC GCGCCTGGGC GACGCCGCCG TCGCCCTGGT GCCGGTATAC CGGGTGCCGG ACGGCTGGAG CCTGCGGCCG GACGGCGAAG CCTTCGATCC GTCGCTGCCG CTGTCCGATT TCCTGGCCCG CCAACTCCAT GGCCGCCGGA TGAAATGCAG TCACGGGCTT GTCGTGAGTC ACTTTGCCGG CGTCGAGCCT CCGCCATCCT TCGCCGGGCA TCCGTTGCTC GGACATCTGA AACCGCTGTT GCTGGAAGAT GGCTGCCACG TCATCGACAG GCTCGTCCTG CGTCTGGATG AGCACCTGGG ACTGGTCTAT GAGTCGGCCG AGGCCGACAT TCAACCCGAG GACTGA
|
Protein sequence | MTTTRFSELP ACSRALWAKS GEPGGHGLLA HMLDVAAVAE TLLEREPPAT HECFAAQFGL PESVFSRHVA ALAGLHDFGK AIPGFQAKWP AGRERDEAHG LGFAEPSLRV TDHACAGAAL LWRHLPAQGA EPGWIMGVLQ AIGAHHGYNP SVPEIRTAVP ALEAPGWAPA RQALFEAYWQ ALAPEGVPAA DELPLPAVAW LAGLTSVADW IASDPDWFPP GEREDSLAGH FRQAGKLAAK ALEEIGWSPH RALLREPATT DELLGRIVRR EGVAARSLQV EGDRLLSEAR GPSLLLVEAP MGEGKTELAF LAHLRLQAAN DHRGLYVALP TQAIGNALFD RALTFLRGFD DRQPLDRQPL DRQPLDIQLV HGGAALDERV QRLRDIHGGT DDSIASSAWF SQRRRPLLSP YGVGTVDQAL FAALNVKHHF VRLWGLANRV VVLDEVHAYD VYTGGLIESL LRWLRALGSS VVLMSATLPE QRRNALLQAW GVATERVPEL PYPRLLLADG RGLRATTFDS RPLPPIALES LDEDLDDLAA CALRLLDGGG HGALIVNTVA RAQELYRRLQ PRLDEEVRLI LFHARFPADE RGMRERQVLE AFGNAGTDRP RPGRCLLIAT QVAEQSLDID FDFLITDLAP VDLILQRAGR LHRHRRARVP AHAGARLFVA GLDPRRLPGL EKTVYEPYLL GRTWALLSRE TGLRLPWDID RLVQAVYGDA PLPDDLDAQV RARIEVEIHG EYLARLKEER QKALNIAIDP SAAPQNAYLD KPRGNEEGEG PGLTKRTRLG DAAVALVPVY RVPDGWSLRP DGEAFDPSLP LSDFLARQLH GRRMKCSHGL VVSHFAGVEP PPSFAGHPLL GHLKPLLLED GCHVIDRLVL RLDEHLGLVY ESAEADIQPE D
|
| |