Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_40500 |
Symbol | |
ID | 7764132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4100541 |
End bp | 4101746 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643806910 |
Product | Phage integrase |
Protein accession | YP_002801161 |
Protein GI | 226946088 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0582] Integrase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCTAT CAGACTCAGC CATCCGCACT GCCAAACCGA GAGAAAAACT GTATCGGCTG GCGGACGCCA ACGGCCTATG CCTGGAGGTG ACGCCCACCG GCTCCAAGCT GTGGCGCTAC CGCTACCGCT TCAACGGCAA TGCAAAGATG CTGGCGCTTG GCGTCTACCC TGCCGTCACG CTCCAGAAAG CCCGCCAGCT GCGTGACGCA GCCCGCCAGT TGCTGGCCGA AGGCAAAGAC CCCAGCGCCG AACGCAAGGC CGAGCAGGAA GCCCAGCAGA GTGACGGCCT GACCTTCGAG ACGCTGGCGC GGGAGTGGCA CGCCTACCGG GCGCCACGTT GGGCGAAGTC CACCGCCGAC AAGACCGCCG CCTATCTGGA GTCGGACTTG CTGCCTGTCC TGGGGAAACG TCCGGTGAAG GCAATCACCC GCCCCGAGTT GGTGGAGCTG CTGCACGGGA TCGAGGACCG GGGCGCGCAC AACGTCGCCA AGAAGTGCCG CCAGTGGCTG AGCCAGATTT TCCGCTTCGG GCTGGCCAAG GGCAGCGTCG ACGCCAACCC CGCCACCGAC CTGGACGTAG TGGCAGCCCA TGCGCCGGCC ACCCGCCACC ATCCGCACGT CACCTTTGCC GAGCTGCCCG AGCTGCTGGG CAAGATCGAG GGCGCCAGCA TCAACGTCCT GACCCGCCAC GCCATCCGCC TGCTGGTGCT GACCGGGGTT CGCCCGGGCG AACTGCGCGC CGCCCCCTGG TCAGAGTTCG ACCTCAAGGC CGCCGTCTGG ACGATCCCGA AAGAGCGCAT GAAAGCCCGC CGCCCGCATA TCGTCCCCCT GCCCCGCCAG GCCATGGCGA TCCTGCGCGA GCTGCAAGAG ATCACCGGGG CATATGAGCT GGTCTTTGCA GGCCGCAACA ACAGCGCCCG CCCGATGAGC GAAAACACGG TGAACAAGGC CCTGGCCGAT GCCGGCTACC GGGGCCGCCA GACCGGCCAC GGCTTCCGCC ACCTGCTGAG CACTGAACTG AACAGCCGGG GGTACAACCG GGACTGGATC GAGCGCCAGC TCGCCCACGG CGACCAAGAC GAGATGCGCG ACACCTACAA CCATGCCACC TACCTGGAGC AGCGCCGGGA CATGATGCAA GCCTGGGCCG ACTCGATAGA CGCGCTGTGT GCTGGCGCCA ACGTGGTGAG CATCAAGAGG GCATAA
|
Protein sequence | MPLSDSAIRT AKPREKLYRL ADANGLCLEV TPTGSKLWRY RYRFNGNAKM LALGVYPAVT LQKARQLRDA ARQLLAEGKD PSAERKAEQE AQQSDGLTFE TLAREWHAYR APRWAKSTAD KTAAYLESDL LPVLGKRPVK AITRPELVEL LHGIEDRGAH NVAKKCRQWL SQIFRFGLAK GSVDANPATD LDVVAAHAPA TRHHPHVTFA ELPELLGKIE GASINVLTRH AIRLLVLTGV RPGELRAAPW SEFDLKAAVW TIPKERMKAR RPHIVPLPRQ AMAILRELQE ITGAYELVFA GRNNSARPMS ENTVNKALAD AGYRGRQTGH GFRHLLSTEL NSRGYNRDWI ERQLAHGDQD EMRDTYNHAT YLEQRRDMMQ AWADSIDALC AGANVVSIKR A
|
| |