Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_50610 |
Symbol | |
ID | 7763910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 5127790 |
End bp | 5129100 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643807890 |
Product | General substrate transporter |
Protein accession | YP_002802124 |
Protein GI | 226947051 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.243642 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCCG AGCACTCCAG CAGCGCGCCC GTGGCGCGTC CGTTGACCCG CAGCGACCTC AAGACCCTCT CGCTCTCCGC CCTGGGCGGC GCGCTGGAGT TCTACGACTT CATCATCTTC GTGTTCTTCG CCACGGTGGT CGGCAAGCTG TTCTTCCCCG CCGAGATGCC CGACTGGCTG CGCCAGTTGC AGACCTTCGG TATCTTCGCC GCCGGCTACC TGGCGCGCCC GCTGGGCGGC ATCGTCATGG CCCACTTCGG CGACCTGCTC GGGCGCAAGC GCATGTTCAC CCTGAGCATC TTCATGATGG CCGTGCCGAC CCTGTGCATG GGTCTCCTGC CGACCTACGC GCAGATCGGC GTCTGGGCGC CGCTGGCGCT GCTCACCCTG CGCGTGGTGC AGGGCGCGGC GATCGGCGGC GAGGTGCCGG GGGCCTGGGT GTTCGTCGCC GAGCACGCGC CGCAGCGGCA CGTCGGTTTC GCCTGCAGCA CCCTGACCGC CGGGCTGACC ACCGGCATCC TGCTCGGCTC GCTGACCGCC AACGCGATCA ACCGGGCGTT CAGCGCCGAG GAACTGGCCG ACTGGGCCTG GCGCCTCCCC TTCCTGCTCG GCGGGGCCTT CGGCCTGGTT TCGGTCTACC TGCGCCGCTG GCTGCACGAG ACGCCGGTGT TCGCCGAACT GCAACTGCGC CAGTCGCTGG CCGCCGAACT GCCGCTCAAG GCGGTGGTGC GCGAGCACCG TCCGGCGGTG CTGCTGTCGA TGCTGCTGAC CTGGGTGCTG TCGGCCGGCA TCGTGGTGAT CATCCTGATG ACTCCGACCC TGCTGCAGAC GCTGCACGGC TTCGCCGCGG AAGAGGCCCT GCGGGCCAAC GGCCTGGCCA TTCTCGGCCT GACCCTCGGC TGCGTGCTGG CCGGCCTCGC GGCGGACCGC TTCGGCGCCG GGCCGACCTT CGTCTGCGGC GGCCTGCTGC TCCTGGCCAG TTCATCGGCG TTCTACGCCA GCCTCGCCGG CCACCGCGAC TTGATGCTGC CGCTGTACGC CCTGGCCGGC CTCTGCGTGG GCAGCATCGG CGCCATCCCC ATGGTGATGG TCAAGGCCTT CCCGGCGGCG GTGCGCTTCT CCGGGCTGTC GTTCTCCTAC AACCTGGCCT ACGCCATCTG CGGCGGCCTG ACGCCGATCC TGGTCAGCCT GCTGCTGAAG TGGAGCCCGC TGGGCCCGGC CTATTACGTC GGCGCACTGT GCCTGCTGTT CATCCTGACC GGCGCCGGCC TGTGGCGGCG CGGAGCCCCC GCGCTGGCGC CGGCCGGCTG A
|
Protein sequence | MSAEHSSSAP VARPLTRSDL KTLSLSALGG ALEFYDFIIF VFFATVVGKL FFPAEMPDWL RQLQTFGIFA AGYLARPLGG IVMAHFGDLL GRKRMFTLSI FMMAVPTLCM GLLPTYAQIG VWAPLALLTL RVVQGAAIGG EVPGAWVFVA EHAPQRHVGF ACSTLTAGLT TGILLGSLTA NAINRAFSAE ELADWAWRLP FLLGGAFGLV SVYLRRWLHE TPVFAELQLR QSLAAELPLK AVVREHRPAV LLSMLLTWVL SAGIVVIILM TPTLLQTLHG FAAEEALRAN GLAILGLTLG CVLAGLAADR FGAGPTFVCG GLLLLASSSA FYASLAGHRD LMLPLYALAG LCVGSIGAIP MVMVKAFPAA VRFSGLSFSY NLAYAICGGL TPILVSLLLK WSPLGPAYYV GALCLLFILT GAGLWRRGAP ALAPAG
|
| |