Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_50320 |
Symbol | |
ID | 7763882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 5101182 |
End bp | 5102129 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643807862 |
Product | ABC transporter substrate binding protein |
Protein accession | YP_002802096 |
Protein GI | 226947023 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.193489 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGTCG CCAGTTCTTC CCGGCCCCTG AGCCGTACGC GGGCTGTCCT GTTCGCAACC GTCCTGTGCC TGGCCGGCTC CACGGCGCTC GCCGATCTCC GGCTCGGCGT CAGCGTCGGC CAGTTCGACA ACTACATCGC TTATCTGGTG CGCGCCATGC AGGAGCGTGC CCGGCAGGTA CCGGGCGGCG TCACGCTGCA GGTCGAGGAC TCCGGCAGCG ATGTGGTCCG CCAGCTCGGC CAGGTCGAAA GCTTCATCGC CCAGCAGGTC GACGCGATCA TCGTCAACCC CGCCGATACC GCCGCCACCG GCGGGATCAC CGAGCGCGCC ACGCGGGCGG GCATTCCGCT GGTGTATCTC AACAGCCGCC CGGAGGTGCG TGAGTTCCCC GCGGGCGTGG TCTTCGTCGG CACCGACGAG CGCCGCCTGG GCCAGATGCA GATGGAGTAC CTGGCCGAGA AGATGGGCGG CAAGGGCGAC CTCGCCATCC TCCTCGGCCG CCTGGCCCAC GACGACACGC GCAAGCGCAC CGCCGGTGTG AAGGACGTGC TGGCCCGCTA CCCGCAGATC CGGGTGGTGG AGGAGCAGTC CGGCGACTGG CAGCGCGACA AGGGCCTGGA TCTGACCAAC AACTGGCTAT CGTCCGGTCG CGAGTTCGAC GCGGTGGTGG CCAACAACGA CGAGATGGGC ATCGGCGCCG CCATGGCGCT GCGCCAGGCC GGGCGCCGGG AGGTGTCGGT CGGCGGTATC GACGGCACGC CCGACGGCCT GGCCGCCATC GCTCGCGGAC AACTGGCCGT GACCCTGCTG CGCGACCCGG TCGCCATGGG CGAAGAGGCC GTGGACGTGG CCCTGCGACT GGTCCGCAAG GACGTCGTGC AAGGCGACGT CTGGATACCG GTTCACCTGA TAACCCCCGA CAACCACACG CAATTCCAGC GCTATTGA
|
Protein sequence | MHVASSSRPL SRTRAVLFAT VLCLAGSTAL ADLRLGVSVG QFDNYIAYLV RAMQERARQV PGGVTLQVED SGSDVVRQLG QVESFIAQQV DAIIVNPADT AATGGITERA TRAGIPLVYL NSRPEVREFP AGVVFVGTDE RRLGQMQMEY LAEKMGGKGD LAILLGRLAH DDTRKRTAGV KDVLARYPQI RVVEEQSGDW QRDKGLDLTN NWLSSGREFD AVVANNDEMG IGAAMALRQA GRREVSVGGI DGTPDGLAAI ARGQLAVTLL RDPVAMGEEA VDVALRLVRK DVVQGDVWIP VHLITPDNHT QFQRY
|
| |