Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_19460 |
Symbol | agaA |
ID | 7760879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 1939192 |
End bp | 1941597 |
Gene Length | 2406 bp |
Protein Length | 801 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643804847 |
Product | Agarase |
Protein accession | YP_002799131 |
Protein GI | 226944058 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCGTC CCCTGGTTCC CCTGCTGTTC GCCTGCTGCC TGGTGCCCCT GTCGGCACGG GCGGCCGACG AGCAGACGCT GTTCAACTTC GTGCGCCCCA CCGATGTGGT GCAGGTCAAG GGCGAAGGCG CCTTCCTGCC GGAACTGACC GCGGAGACCA CGGCGGGAGG AGATGTGCTC CGAAGAGTCA CCTTCAATCC GCAGGAGCGG CCCAGCCTGC GTCTGACGCC GCAGCAGGGC AACTGGGACT GGTCTTCCGC CGGCGCGGTC AGCCTGCGCA TCCAGAATGC CATGGACTGG GCGTTGACCC TGCAGGTGAG CATCGAGAGC GCCGACGGCA AGGTTCTCAG GAGTCAGGTC GCCCTGCCGG CCGGTCCGGC GCAGACCCTG GTACTGCCGC TGCGCGTCAG CTCGCCCAGG GCGCATGGCA TGCGCGCCGC GCCGCCGATG CCCTGGACCC ATGAAAACCA GCGCCTGCTG GTGGCGACCA CCCTGGAAGG CGAGATCGAT CCGCGTCAGG TGCAGGCGGT GAGCCTCTCC CTGGAGAGAC CGGACGTGCA GCAGAGCATT CTGCTCGGCC GCTTCGGCGT GCGCGAGGAT CTCGAGCCGG CGGCCTACCG GGGGATCGTC GATGCCTATG GGCAGTACAG CCGCGGCGAC TGGCCGGAGA AGGTGAACAG CGACAAGCAA CTGAAGACGG CGGCCGAGCA GGAGCGGGCG CAGCTCGATC GCTGGCTGGT CGGGCGGCCG GAGCTGGACC GCTTCGGCGG CTGGCTGAAA GGACCGCAAC TGGAGGCCAC GGGCTTCTTC CGGGTGGCCA GGCACGAGGG CCGCTGGTAC CTGGTGACGC CGGACGGCCA TCCGTTCTTT TCTCTCGGCG TGAACACGGT CTCTTCCGGC AACAGCCGGA CCTACGTCGA AGGGCGCGAG GAGATGTTCC TCGCCCTGCC CGGCAAGGAG GAGCCGCTGG GCGCCTTCTA TGGGGCCGGG GACAGCCGCC AGGCCACCGG CGCCAACGAC GGGCGACAGT TCGCCCATGG GCGCTGGTAC GATTTCTATC GGGCGAATCT CTATCGGACC TACGGACAGA GTTGCAAACC GCCCGTGGAG CAGGCCGCCG AGCCGGAGCG CGTGTTGCCG GCAGCTCCCA TGGAAGAGGA ACCGGACAAC GTTCAGGTCG CTCCGGAGGG CGGCACGCCG GCCCCCGGAC AACCGGCTCC CGCCAAGCAG CCGACCGCCG CGCCGCCGTG CGTCGTGCAG TTCTTCGACG CCCTGCGCTG GCGCGGGCAC ACCTTGGACC GCCTGCAGGC CTGGGGTTTC AATACCCTCG GCAACTGGAG CGATCTCTCC CTGGGAGCGA TGCACCGGAT ACCCTACACG ATTCCGCTGC TGATCCGTGG CGACTACGCC ACCATCTCCA CCGGCCACGA CTGGTGGGGC GGCATGCCCG ATCCCTTCGA CCCGCGCTTC GCCATGGCTG TCGAGCGAGC CATCGCCATC GCTACCCGCG ATCACCGGAA CGACCCCTGG GTGATCGGCT ACTTCGCCGA CAACGAGCTC AGTTGGGCCG CGCCGGGGAC GGATCCGAAG GCCCGCTATG CCCTGGCCTA CGGCACCCTG CGGCAGACCA CCGACATGCC TGCCAAGCGC GCCTTCCTCA AGTTGTTGCG CGACCGTTAT CGCAACCAGC AGGGGCTCTC CGCCGCCTGG GGCATCGAAC TGCCCGCCTG GGAGCTGATG GAGGATCCGG GGTTCGAGGC GCCCCTGCCG AGCCCGGAGC ATCCGGCCAT CGAGGAGGAT CTGCAACGCT TCCAGCAGCT CTTCGCCGAC ACTTACTTCA AGACCATCGC CGAGTCGCTG AAGTGGCATG CGCCCGACCA TCTGCTGCTC GGTGGCCGCT TCGCCATTAG CACTCCCGAA GCGGTCGAGG CCTGCGCCAA GTACTGCGAC GTGCTGAGCT TCAACTTCTA CACCCGCGAG CCCCAGCACG GTTATGACTT CGAGGCCTTG CGCAAGCTGG ACAAGCCGAT GCTGGTCAGC GAGTTCCATT TCGGTTCGCG GGATCGCGGC CCGTTCTGGG GTGGCGTGGC CGAGGTGTAC AAGGAAGAAG AGCGCGGCCC GGCCTATGCC CACTTCCTGG AGCGAGCTCT GGCCGAGCCC TTCATCGTTG GCATGCACTG GTTCCAGTAT CTCGACCAGC CGGCGACCGG GCGCCTGCTC GATGGCGAGA ACGGCCATAT CGGTCTGGTC GGCGTCACCG ATCGACCATT CGCCGGTTTC GTCGAGGCGT TGCGCAAGGC CAACCTGAAA GTGGGCAAGG CCTTCGAGCC GGTTGCCACG CCCGCTGCCG GACAGCTGAA GACGGAGGGC GGCGCTCCCG CCGCGGCACC AGCCAGGACA CAGTGA
|
Protein sequence | MIRPLVPLLF ACCLVPLSAR AADEQTLFNF VRPTDVVQVK GEGAFLPELT AETTAGGDVL RRVTFNPQER PSLRLTPQQG NWDWSSAGAV SLRIQNAMDW ALTLQVSIES ADGKVLRSQV ALPAGPAQTL VLPLRVSSPR AHGMRAAPPM PWTHENQRLL VATTLEGEID PRQVQAVSLS LERPDVQQSI LLGRFGVRED LEPAAYRGIV DAYGQYSRGD WPEKVNSDKQ LKTAAEQERA QLDRWLVGRP ELDRFGGWLK GPQLEATGFF RVARHEGRWY LVTPDGHPFF SLGVNTVSSG NSRTYVEGRE EMFLALPGKE EPLGAFYGAG DSRQATGAND GRQFAHGRWY DFYRANLYRT YGQSCKPPVE QAAEPERVLP AAPMEEEPDN VQVAPEGGTP APGQPAPAKQ PTAAPPCVVQ FFDALRWRGH TLDRLQAWGF NTLGNWSDLS LGAMHRIPYT IPLLIRGDYA TISTGHDWWG GMPDPFDPRF AMAVERAIAI ATRDHRNDPW VIGYFADNEL SWAAPGTDPK ARYALAYGTL RQTTDMPAKR AFLKLLRDRY RNQQGLSAAW GIELPAWELM EDPGFEAPLP SPEHPAIEED LQRFQQLFAD TYFKTIAESL KWHAPDHLLL GGRFAISTPE AVEACAKYCD VLSFNFYTRE PQHGYDFEAL RKLDKPMLVS EFHFGSRDRG PFWGGVAEVY KEEERGPAYA HFLERALAEP FIVGMHWFQY LDQPATGRLL DGENGHIGLV GVTDRPFAGF VEALRKANLK VGKAFEPVAT PAAGQLKTEG GAPAAAPART Q
|
| |