Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_30280 |
Symbol | hepA |
ID | 7761928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3135773 |
End bp | 3138622 |
Gene Length | 2850 bp |
Protein Length | 949 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643805900 |
Product | ATP-dependent helicase HepA |
Protein accession | YP_002800168 |
Protein GI | 226945095 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.561885 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGCAGC AATATCAACC GGGGCAGCGC TGGATCAGCG ACAGCGAGGC GGAACTCGGG CTCGGAACCA TTCTCACCGC GGATGGCCGG TTGCTCACAG TGCTCTATCC GGCCACCGGC GAAACCCGCC AATATGCCCA GCGCAACGCG CCCCTGACCC GCGTGCGCTT CGCCCCCGGC GACGAGATCA CCCACTTCGA CGGCTGGAAG ATGATCGTCC GCGAGGTCGA GGACCAGGGC GGCCTGCTCA TCTACCACGG CCTCGACGCC CAGAACCAGG GGTGCAGCCT GCCGGAAACC CAGTTGTCCA ACTTCATTCA GTTCCGCCTG GCCAGCGACC GCCTGTTCGC CGGGCAGATC GACCCGTTGC CCTGGTTCGG CCTGCGCTAT CACACCCTGG AACACCGCAG CAGCCTGCTG CAGTCCCCGC TCTGGGGTCT GGCCGGCGCC CGCGCCCAGC CGATCGCCCA CCAGTTGCAC ATCGCCCGCG AGGTCGCCGA CCGCGTCAAT CCGCGCGTGC TGCTGGCCGA CGAGGTGGGC CTGGGCAAGA CCATCGAGGC CGGCCTCATC ATCCATCGCC AGTTGCTTTC CGGCCGCGCC GGCCGGGTGC TGATCCTGGT GCCGGAAAAC CTCCAGCACC AGTGGCTGGT GGAGATGCGC CGGCGCTTCA ATCTGGAAGT GGCGCTGTAC GACGCCGAAC GCTTCACCGA GAGCGATGCC AGCAATCCCT TCGAGGACAC CCAACTGGCG CTGGTCTCCC TGGAGTGGCT GACCGTCGCC GAGCACGCCC AGGACGCCGC CTTCGCCGCC GGCTGGGATC TCCTGGTAGT GGACGAGGCG CATCATCTGG TCTGGCACCC CGAGCAGCCG AGCGCCGAAT ACACGCTGGT CGAGCAGCTC GCCCAGGTCA TCCCCGGCGT GCTGCTGCTC ACCGCCACTC CCGAACAGCT CGGCCAGGAG AGCCACTTCG CCCGCCTGCG CCTGCTCGAC CCCGACCGCT TCCACGACCT CGAAGCCTTC CGCGCCGAAA GCGCCAACTA CCGTCCGGTC GCCGAGGCGG TGCAGGAACT GCTCGACGAA GGCCGCCTCT CCGAGCGCGC CCACGCGGTC ATCCGCGGCT TTCTCGGCGC CGAGGGCGAA GCGTTGCTCG CCGCGCTCAG CGATGGCGAC ATCCAGGCCG GCGCGCGCCT GACCCGCGAA CTGCTCGACC GCCACGGCAC CGGCCGGGTG CTGTTCCGCA ACACCCGCGC CGCGGTGCAG GGCTTCCCCG AGCGCCAGTT GCACCCCTAC CCGCTACCGA GCCCGGCCGA GTATCTGGAG CTGCCGATCG GCGAGCATGC CGACCTGTAT CCGGAGGTCA GCTTCCAGGC CCGCCAGGAG GACGACAGCG AAGAGAACCG CTGGTGGCGC TTCGACCCGC GCGTCGAATG GCTGATCGAC ACCCTGAAGA TGCTCAAGAA GGTCAAGGTG CTGGTGATCT GCGCCCACGC CGAGACCGCG CAGGACCTGG AAGACGCCCT GCGGGTGCGC TCCGGCATCC CGGCCACGGT GTTCCACGAG GGCATGAGCA TCCTCGAACG CGACCGCGCG GCGGCCTGGT TCGCCGACGA GGAGTTCGGC GCCCAGGTGC TGATCTGCTC GGAAATCGGC AGCGAGGGCC GCAACTTCCA GTTCGCCCAC CATCTGGTGA TGTTCGACCT GCCGGCCCAT CCGGACCTGC TCGAACAGCG CATCGGCCGC CTCGATCGCA TCGGCCAGAA ACACGTCATC CAGTTGCACG TGCCCTATCT GGAGAACAGC GCCCAGGAAC GCCTGTTCCA CTGGTATCAC CAGGCGCTGA ACGCCTTCCT CGCCACCTGC CCCACCGGCA ATGCCCTGCA GCACCGGTTC GGCCCGCGCC TGCTGCCGCT GCTGGAAGGT GGCGACGACG ACCAATGGCA GGAACTGCTG GATACCGCCC GCGCCGAACG CGAACGCCTG GAGAGCGAAC TGCACGCCGG CCGCGACCGC CTGCTGGAAC TCAACTCCCG CGGCGGCGGC GAAGGCGATG CGCTGGTCGA GGCCATCGAC GAACAGGACG ACCGGTACGC CCTGCCGATC TATATGGAAG AGCTGTTCAA CGCCTTCGGC ATCGACAGCG AGGACCATTC GGAGAACGCG CTGATCCTGC GCCCCAGCGA AAAGATGCTC GACGCCAGCT TCCCACTGGG CAGCGACGAG GCGGTGACCG TCACCTACGA CCGCGCCCAG GCGCTGGCCC GTGAGGACAT GCAGTTCCTC ACCTGGGAGC ATCCCATGGT GCAGGGCGGC ATGGATCTGG TGCTGTCCGG CTCGATGGGC AACACCGCAG TGGCGCTGAT CAAGAACAAG GCGCTCAAGC CCGGCACCGT GCTGCTCGAA CTGCTCTACG TCAGCGAAGT GGTGGCGCCG CGCGCCCTGC AGCTCGGCCG CTTCCTGCCA CCGCTGGCGC TGCGCTGCCT GCTCGACGCC AACGGCAACG ACCTGGCGCC GAAGGTCGCC TTCGACACCC TCAACGACCA GTTGGAAAGC GTGCCGCGCG GCAGCGCCAA CAAGTTCGTG CAGGCCCAGC GCGACGTGCT GGCCAAGCAG ATCAACGCCG CCGAAGCCAA GGTCGCGCCG CGCCATGCCG AGCGCGTCGC CGAGGCCAGG CAGCGCCTCG CGGCCAGCCT CGACGAGGAG TTGGCCCGCC TGACCGCGCT CCGGGCGGTC AACCCGAGTG TGCGCGACAG CGAACTGGAA GCCCTGCGCC GGCAGCGCGA AGATGGCCTG GCCATGCTCG ACAAGGCGGC TCTGCGCCTG GAGGCGATCC GTGTGATGGT CGCGGGCTGA
|
Protein sequence | MVQQYQPGQR WISDSEAELG LGTILTADGR LLTVLYPATG ETRQYAQRNA PLTRVRFAPG DEITHFDGWK MIVREVEDQG GLLIYHGLDA QNQGCSLPET QLSNFIQFRL ASDRLFAGQI DPLPWFGLRY HTLEHRSSLL QSPLWGLAGA RAQPIAHQLH IAREVADRVN PRVLLADEVG LGKTIEAGLI IHRQLLSGRA GRVLILVPEN LQHQWLVEMR RRFNLEVALY DAERFTESDA SNPFEDTQLA LVSLEWLTVA EHAQDAAFAA GWDLLVVDEA HHLVWHPEQP SAEYTLVEQL AQVIPGVLLL TATPEQLGQE SHFARLRLLD PDRFHDLEAF RAESANYRPV AEAVQELLDE GRLSERAHAV IRGFLGAEGE ALLAALSDGD IQAGARLTRE LLDRHGTGRV LFRNTRAAVQ GFPERQLHPY PLPSPAEYLE LPIGEHADLY PEVSFQARQE DDSEENRWWR FDPRVEWLID TLKMLKKVKV LVICAHAETA QDLEDALRVR SGIPATVFHE GMSILERDRA AAWFADEEFG AQVLICSEIG SEGRNFQFAH HLVMFDLPAH PDLLEQRIGR LDRIGQKHVI QLHVPYLENS AQERLFHWYH QALNAFLATC PTGNALQHRF GPRLLPLLEG GDDDQWQELL DTARAERERL ESELHAGRDR LLELNSRGGG EGDALVEAID EQDDRYALPI YMEELFNAFG IDSEDHSENA LILRPSEKML DASFPLGSDE AVTVTYDRAQ ALAREDMQFL TWEHPMVQGG MDLVLSGSMG NTAVALIKNK ALKPGTVLLE LLYVSEVVAP RALQLGRFLP PLALRCLLDA NGNDLAPKVA FDTLNDQLES VPRGSANKFV QAQRDVLAKQ INAAEAKVAP RHAERVAEAR QRLAASLDEE LARLTALRAV NPSVRDSELE ALRRQREDGL AMLDKAALRL EAIRVMVAG
|
| |