Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_33770 |
Symbol | |
ID | 7762272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3452075 |
End bp | 3453301 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643806238 |
Product | hypothetical protein |
Protein accession | YP_002800502 |
Protein GI | 226945429 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.756882 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACCG AACACGCCGA CCTCGCAGTC GAGCGCGCTT CGCCATGGCT GCCGATCTGG GCCGGCCTGT GTGCCAGCCT GGTCGGCATC GGTCTTGCCC GCTTCGCCTA TACCTCCCTG GTGTCCGCGC TGATCGAAGC GCACTGGTTC TCCCCATCGG CGGTGATCTA CCTGGGAGCG GCCAATCTCG CCGGCTACCT CATCGGAGCG CTGGGAGGAC GGCCGCTCGC CGCCCGCTGG TCGGACACCG GAGCACTGCG CGCCATGATG GTGCTGGTGG CCCTGTCGTT CTTCGCCTGT GCCGTACCGC TGTCCATGAC CTGGTTTTTC GGCTGGCGGC TGCTGTCCGG CATCGCCGGC GGGGTGATCA TGGTGCTCGT GGCAAGTACC ATCCTGCCGC ATGTCGCGCC GAACCGGCGC GGCCTGGCCA GCGGCGCGAT CTTCCTCGGC GTCGGCCTGG GCATCGCCGC CTCGGGCAGC CTGGTGCCGC TCCTGCTCTC CTTCGGCCTG TCGCAGGCCT GGCTGGGTCT GGGCGGGATA GCGCTGCTGC TGACGGCCAC CAGTTGGTTC GGCTGGCCTC GGGAAAGCCC GCCTGCGCCG CTCATCTCCG GCGCATCCGC CGTGCGCGGC TCGACGGGCG TCTACGTGCT GTTCGCCCAA TATGCGCTGA TGGCCGCCGG GCTGGTGGCG CCGATGATGT TCCTGGTCGA CTACGTCAGC CGCGGCCTCG GCGCCGGCAG CCACAGCGGC GCGCTGATCT GGACGCTGTA CGGGGTGGGC GCCATCGTCG GGCCGGTGGT GTATGGGTTC CTGGCCGACC ACTGGGGCGC GCGCTCGGCG ATTCGCCTGG TGCTGCTGGT CCAGGCCGGT GCTCTCGCGC TGCTGGTGAC GGTCGTCGAC CTGCGCCTGC TGGCCGTCGC GGCGCTGGCG ATAGGTTCGT TTCCGCCCGG CATAGTGCCG CTGGCACTGG CCCGCGTGCG GCAGTTGGTT GCCGGACATC ACCAGCAGAG CGCCACCTGG AGCCGTGCCA CGGTGTCCTT CGCGAGCTTC CAGGCGCTGG CCGGTTATGC CTATTCGGCG CTGTTCTCCG CCAGCCACGG CCACTACACG CTGCTGTTCT CCGTCGCTGC GGGCGCCATC GGTCTGGCCC TGGCGCTGGA TCTGCTGCTG GCGTTCAAGA AAGTCTTCCG CCCCGCCACC GGAGCCGACG CGACACCGGA GGCATAG
|
Protein sequence | MNTEHADLAV ERASPWLPIW AGLCASLVGI GLARFAYTSL VSALIEAHWF SPSAVIYLGA ANLAGYLIGA LGGRPLAARW SDTGALRAMM VLVALSFFAC AVPLSMTWFF GWRLLSGIAG GVIMVLVAST ILPHVAPNRR GLASGAIFLG VGLGIAASGS LVPLLLSFGL SQAWLGLGGI ALLLTATSWF GWPRESPPAP LISGASAVRG STGVYVLFAQ YALMAAGLVA PMMFLVDYVS RGLGAGSHSG ALIWTLYGVG AIVGPVVYGF LADHWGARSA IRLVLLVQAG ALALLVTVVD LRLLAVAALA IGSFPPGIVP LALARVRQLV AGHHQQSATW SRATVSFASF QALAGYAYSA LFSASHGHYT LLFSVAAGAI GLALALDLLL AFKKVFRPAT GADATPEA
|
| |