Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_51940 |
Symbol | |
ID | 7764031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 5298757 |
End bp | 5300289 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643808010 |
Product | extracellular solute-binding protein |
Protein accession | YP_002802244 |
Protein GI | 226947171 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000130332 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCCGT TCCGCCACCT CTGCCTGCTG TCGCTGCTGG CCTGCCTGGG CGCCTGCGGC CCGAACCAGG ACCCGGAGGT GCTGACCCTC GGCGGTCCCT TCGAATTCAC CAGCCAGGAC CCGGCGCGCG ACGGCTTCGT CTATACCCGC CTGCAGGTGG CCGAGAGCCT GCTGGAGGTG GACGACGCCG GCCGGCTGCT GCCCGGCCTC GCGCAGGGCT GGGCAGTCGA CGACGACGGA CTGACCTGGC ATTTGCGCCT GCGCGAGAAG GTGCGCTTCC ACGACGGCCT GCCGCTGGAC GCCGATGCCG TGGTCCGGGC GCTGGAAATC GCTCGGCGCA AGCCCGGCGT GCTGCGCTCG GCGCCGATCG TCGAGATCCG CGCCGAGGAC CGGCTGGGCG TGGCCATCCG CCTCGCCAGG CCCTACAACC CGCTGGGTGC GCTGCTGGCG CACTATTCGA CGGTCATCCT CTCCCCCGCC TCCTACCGGG ACGGCAGCGA GGTCGGCTGG ATGCAGGGCA CCGGCCCCTA CCGCCTGGAA GCCTTCGATC CGCCGCACCG CATCCGCGTG ACCCGCTTCG ACGGCTACTG GGGCACCCCG GCGCGCATCC CGCAAGCGCT CTACCTCACC GGGCACCGCG CCGAGAGCCG CGCCCTGCAG GTCATGGCCG GGCAGACCGA CATCGTCTAC ACCCTCGACC CCGCCAGCCT GGACCTGCTG CGCCGGCAGA AGGACATCCG CGTGCATTCC GACGCCATCC CCCGCACCAT CCAGATCAAG CTCAACGCCG GCCATCCGTT CCTCGCCGAG CGCGATGCCC GGCTGGCCAT GAGCCTGGCC CTGGACCGCC AGGGCATCGC TAGCCACCTG GTGCGCGTGC CCGGCATGGA AGCCAACCAG TTGATCCCGC CGGCGCTGGC CGACTGGCAC CTCGACGACC TGCCGCCGAT CCGCCGGGAC CCCGAACGCG CGCGGCGACT GCTCGCCGAT CTCGGCTGGC GACCGGGGCC GGACGGCATC CTGCAACGCG CCGGCCAGCG CTTCCGGCTG ACCCTGGTCA CCTACGCCGA CCGCCCCGAA CTGGCGGTGG TCGCCACGGC CATCCAGGCG CAACTGCGCG AGGTCGGCGT CGCCGTCGCC GTGGGCATCG TCAACTCCAG CGGCATCCCT TCCGCCCACC ACGACGGCTC GCTGCAACTG GCCCTGGTGG CGCGCAACTA CGGCAACGTC GCCGATCCCC TGAGCCTGCT GGCCGCCGAT TACGGCGACG GCGGCAATGG CGACTGGGGC GCGATGGGCT GGCGCAACGA GGAATTGCCG GCCCTGCTGA GAGGGCTCGA AGCCGAACGC GACCCGGCGC GCTACCGGGC GGATGCCCGG CGGATCGCGC GCATCCTCGC CGAGGAACTG CCGGTGATCC CGGTGCTCTT CTACACGCAA CAGACGGCCG TCGCGGCCCG CGTGCGGGAT TTCGGCTTCG ACCCCTACGA GCGCAACTAC CGCATTTCCC GGATGAGCTT CGCGAGCCCA TGA
|
Protein sequence | MRPFRHLCLL SLLACLGACG PNQDPEVLTL GGPFEFTSQD PARDGFVYTR LQVAESLLEV DDAGRLLPGL AQGWAVDDDG LTWHLRLREK VRFHDGLPLD ADAVVRALEI ARRKPGVLRS APIVEIRAED RLGVAIRLAR PYNPLGALLA HYSTVILSPA SYRDGSEVGW MQGTGPYRLE AFDPPHRIRV TRFDGYWGTP ARIPQALYLT GHRAESRALQ VMAGQTDIVY TLDPASLDLL RRQKDIRVHS DAIPRTIQIK LNAGHPFLAE RDARLAMSLA LDRQGIASHL VRVPGMEANQ LIPPALADWH LDDLPPIRRD PERARRLLAD LGWRPGPDGI LQRAGQRFRL TLVTYADRPE LAVVATAIQA QLREVGVAVA VGIVNSSGIP SAHHDGSLQL ALVARNYGNV ADPLSLLAAD YGDGGNGDWG AMGWRNEELP ALLRGLEAER DPARYRADAR RIARILAEEL PVIPVLFYTQ QTAVAARVRD FGFDPYERNY RISRMSFASP
|
| |