Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_31110 |
Symbol | |
ID | 7762011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3216547 |
End bp | 3217629 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643805986 |
Product | ABC transporter substrate binding protein |
Protein accession | YP_002800250 |
Protein GI | 226945177 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.24737 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTTCG ACTCCAACCT CTCCCGCCGT CGCCTGCTCG GCCTGGCCGG CACCGCCGCC GCGGCCGCCG CCGTCGGCCG CTTCGCCTGG GCCGCCGATC CGCACGCCCA TGCCGCCCAT GCCGCCCATG GCGCCCATGG CGACGACGCA CAGAATTTCC TGCGCGACGC CAAGAGCTGG GCACTGCCGG CGCCGCGCAA GCTCAAGCTG GCGACCAACC TCAACGCCAT CTGCCTGGCC CCGGTGGCGG TGGCCGACAG CCAGGGCTTC TTCCGCAACC ACAACCTGGA GGTCGAGTTC GTCAACTTCG GCAATTCCAC CGAGGTGCTG CTCGAATCCC TAGCCACCGG CAAGGCGGAT GCCGCCACCG GCATGGCGCT GCGCTGGCTG AAGGCGCTGG AACAGGGCTT CGACGTCAAG CTGACCGCCG GCACTCACGG CGGCTGCCTG CGCCTGATCG CCCAGGAAGG CGGCCCGCGC AGTTTCGAGG AACTCAAGGG CAAGACCATC GGCGTCACCG ACATGGCCAG CCCGGACAAG AACTTCTTCT CGCTGATGCT CAAGCGCCAC GGCGTCGACC CGGTCCGCGA CGTGACCTGG CGGGTCTATC CGATCGACCT GCTCGGCACC GCCCTGGAGA AGGGCGAGGT CCAGGCGGCC AGCGGCTCCG ACCCGATGAT GTACCGCCTG CGCAACCAGC CGGGCAAGCG CGAGCTGTCC AACAACCTGG TCGAGGAGTA CGCCAACCTG AGCTGCTGCG TGGTCGGCGT CGGCGGCAAC CTGGTGCGTA AGGAGCGGCC GGTCGCCGCC GCCGTCACCC ACGCCATCCT GCAGGCCCAC GCCTGGGCGG CGCAGCACCC GGAAACCGTG GCCCAGGACT TCCTCAAGTT CGCGGTCAAC ACCAATTCCG AGGAAATCAA CGCCATCCTC AACGAGCACA CCCACGCGCA CTACTCGGTG GGCAAGGCCT TCGTCGACGA GATCGCCGTC TACGCCCGCG ACCTGAAGGC CGTGGAAGTG CTGCGCGCCA GCACCGATCC CCGGAAATTC GCGGAGAGCA TCCATGCCGA CGTATTCGGT TGA
|
Protein sequence | MTFDSNLSRR RLLGLAGTAA AAAAVGRFAW AADPHAHAAH AAHGAHGDDA QNFLRDAKSW ALPAPRKLKL ATNLNAICLA PVAVADSQGF FRNHNLEVEF VNFGNSTEVL LESLATGKAD AATGMALRWL KALEQGFDVK LTAGTHGGCL RLIAQEGGPR SFEELKGKTI GVTDMASPDK NFFSLMLKRH GVDPVRDVTW RVYPIDLLGT ALEKGEVQAA SGSDPMMYRL RNQPGKRELS NNLVEEYANL SCCVVGVGGN LVRKERPVAA AVTHAILQAH AWAAQHPETV AQDFLKFAVN TNSEEINAIL NEHTHAHYSV GKAFVDEIAV YARDLKAVEV LRASTDPRKF AESIHADVFG
|
| |