Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_43610 |
Symbol | |
ID | 7763234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4405960 |
End bp | 4407510 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643807216 |
Product | ABC transporter |
Protein accession | YP_002801457 |
Protein GI | 226946384 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1129] ABC-type sugar transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00213135 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCCG CCGAAACGCC GCCGCAGGCG GTCGTCCTGG AACTGCGCGG CATCGTCAAG CGCTTCGGCG CCACCCGCGC GCTGGACGGC GCCAGCCTGC GCGTGCGGCG CGGCAGCGTG CACGGGCTGG TGGGCGAGAA CGGCGCCGGC AAGTCGACGC TGATCAAGGT GCTGGCCGGC ATCCACCGGC CGGACGCCGG CAGCCTGCTG GTCGACGGCC GCGCGCATGC GCATTTCGAT CCGCGCCAGG TGGAGCGCCT GGGCATCCGT TTCATCCACC AGGAGCGCCT GCTGCCGGCC GGCTTCACGG TCGACGAGGC GCTGTTCTTC GGCCAGGAAC GCCGGCTCGG CCCCTGGCTC GACCGCCGCG CCCAGCGGCG CGAGGCCGAG CGCCTGCTGG AGCATTGGTT CGGCCTGCGC CTGCCGGCCG GCGCGCTGGT CGGCGAACTG AGCAGCGCCG AACAGCAGGT GCTGCAGATC GTCCGCGCGC TGATCGTCAA GCCGCGCGTG CTGGTCTTCG ACGAGCCCAG CGTGGCGCTG GTGCGCAGCG AAGTGGAGCG GTTGCTGTGC ATCGTCCGCC GCCTGCGCGA CGAGGGCCTG GCGATCGTCT ACATCTCCCA CTACCTGCAG GAGATCGAGG CGCTCTGCGA CCGGGTGACG GTGCTGCGCA ACGGCCGCGA CGTGGCCGAG GTCGATCCAC GCGCCACTTC GCCGGAGCGG ATCGCCCGGC TGATGGTCAA CCGCGAGGTC GGCGAGCTGT ACCCGAAGAG CGCGCCGGCG CCCGGCGCGC CGCTGCTGGA GGTGCGCGGC CTGGGACGCG GGCGGGCCTA TCGCGACATC GACCTGAGGG TGCGCCGCGG CGAGGTGGTC GGCCTCACCG GGCTGGTCGG TTCCGGCGCC AAGGAGCTCT TGAAGAGCTT GTTCGGCCTG GCCCCGCCGG ATACCGGCGA GGTGCGCCTG GACGGCCGTC CGCTGGCGTT GCGCACGCCA CGCCAGGCGG TGGCCGAGGG CATCGCGCTG CTGCCCGAGG AGCGCCGCCG GCAGGGCGTG GCCCTGGACC TGAGCGTGCA GGAGAACGTC ACCCTGGCGG CACTGCCGCG TTTCTCCCGC TTCGGCCTGC TGTCGCGACG CGCGGAGCGG CGCGAGACGC TCGGCCTGAT CGAGCGGCTG CGGATCAAGA GCGCCGGCCC GCAGGCCACC GTGCGCCAGC TCAGCGGCGG CAACCAGCAG AAGGTGGCGC TGGCCAAGTG GTTCGCCCGC CGTTCCAGCC TGTACCTGCT GGATGAGCCC AGCGTCGGCA TCGATATCGG CGCCAAGGCG GAGATCTACC GGCTGATCGG CGAACTGGCC AGGGAGGGCG CCGGGGTGCT GATCCTCTCG TCCGACCTGC CGGAGCTGCT CGGCCTCTGC GAGCGCATCC ACGTCATGCA CCGCGGGCGG ATCGCCGCCC GTTTCGAGGC CGGCGAGGCG GACAGCGACA AGCTGCTGGC CGTCGCCACC GGCGCCGTCG CACAGCAGGA GACTTCCGTC CATGAATCCA TCCCCGGTTG A
|
Protein sequence | MSAAETPPQA VVLELRGIVK RFGATRALDG ASLRVRRGSV HGLVGENGAG KSTLIKVLAG IHRPDAGSLL VDGRAHAHFD PRQVERLGIR FIHQERLLPA GFTVDEALFF GQERRLGPWL DRRAQRREAE RLLEHWFGLR LPAGALVGEL SSAEQQVLQI VRALIVKPRV LVFDEPSVAL VRSEVERLLC IVRRLRDEGL AIVYISHYLQ EIEALCDRVT VLRNGRDVAE VDPRATSPER IARLMVNREV GELYPKSAPA PGAPLLEVRG LGRGRAYRDI DLRVRRGEVV GLTGLVGSGA KELLKSLFGL APPDTGEVRL DGRPLALRTP RQAVAEGIAL LPEERRRQGV ALDLSVQENV TLAALPRFSR FGLLSRRAER RETLGLIERL RIKSAGPQAT VRQLSGGNQQ KVALAKWFAR RSSLYLLDEP SVGIDIGAKA EIYRLIGELA REGAGVLILS SDLPELLGLC ERIHVMHRGR IAARFEAGEA DSDKLLAVAT GAVAQQETSV HESIPG
|
| |