Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_42320 |
Symbol | |
ID | 7763108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | - |
Start bp | 4258195 |
End bp | 4259802 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643807083 |
Product | major facilitator transpoter |
Protein accession | YP_002801331 |
Protein GI | 226946258 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.956299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTAC CCGCCACCTC GCCCTGGAGC CCGCTGCGCC AGACGACCTT CCGCTGGCTG TGGCTGGCCA GCATCGCCTC GAACATCGGC ACCTGGATGC ACGAGGTCGG TGCCGGCTGG CTGATGACCA CGCTGTCGGC CAGCCCTTTG AACGTGGCGC TGGTGCAGGT GGCCGGCTCC CTGCCGATGT TCTTCCTGGC CCTGCCGGCC GGTGCCCTGG CGGACATCGT CGACAAGCGC CGCTACCTGC TCGGCGTGCA ACTGTGGATG GCGACGGTGG CCACGCTGCT GGCGGCGCTC ACCCTGCTCG GCCTGACCAG CGTCTGGCTG CTGCTGGCGC TGACCCTGTG CATGGGCATC GGCACGGCGC TGATGATGCC GGCCTGGAGT GCGACCACGC CGGAGCTGGT GGACAAGGAT GAGCTGCCGG CGGCGGTGGC GCTGTCCAGC GTCGGCGTCA ACCTGGCGCG GGCGGTCGGC CCGGCCATCG CCGGGGTGCT GGTCAGCCTG GTCGGTCCCT GGCTGACCTT CGCCCTCAAC GCGCTCTCGT TCTTCGCGGT GATCGCCGTG CTGCTGGCCT GGAGACGCGA GACGGAGCCC GCGGTGCTGC CGGCCGAACG TCTGTTCGGC GCGCTGCGCG CCGGCTGGCG CTACAGCCGC AGCTCGCGGC CGCTGCAGGC GGTGCTGGTG CGCGCGCTGG CGTTCTTCCT CGGCGCCAGC GCCGGCATGT CGCTGCTGCC GCTGATCGTG CGCGGCGAGC TGCAGGGCAG CGCCACCGAC TTCGGCCTGT TGCTCGGCTG CGTCGGCATC GGCGCGGTGC TCGGCGCCGC CTTCCTGCCG CGCCTGCACG AGCGGCTGGG TGGCGACCGC CTGGTGCTGC TGGCCAGCCT GCTCTACGCG CTGGTGCTGA TCGCCCTGGC GCTGCTGCGC GACCTCTACC TGCTGGTCCC GGTGATGCTG CTCAGTGGCG CGGCCTGGAT CGCCGTGCTG TCCAGCCTGC AGGTGGCGGC GCAGACCTCG GTGCCGGGCT GGGTGCGGGC GCGCGCGCTG GCCGTCTACA TCCTGGTGTT CTTCGGCAGC ATGGCTGCCG GCGGCACCCT GTGGGGGCTG GTCGCCAGCC GCGCGTCAAT CCCCTTCGCC CTGCTGTGCG CCGCCGCGCT GCTGGCGCTG GGGCTGCTCG TCGCCCTGCG CTTCCACCTG CCGGTCACCG AGGCGGAGGA TCTGGCGCCC TCGCTGCACT GGCCGGCGCC GATCCTCGCC GAGGGCCTGG ACCGGGAGCG GGGGCCTGTG GTCGTCACCC TGGAATACGA CATCGACCCG CGCAGGGCGG CGGCCTTCCA GCAGGCGATG GAGGAGGTGC GTGGCATGCG CCGGCGCAAC GGGGCGATTT CCTGGTGCCT GGTGCAGGAC AGCGAGAATC CGCGCCAGTG GCTGGAGTTC TTCATCGACG AATCCTGGCT GGAACACCTG CGCCATCATC AGCGGGTGAC CCGTGGCGAA TTGAAAATCG AGGCGGCTGC CCGGCGCTTC CAGACCCCGG GCATCGATAT CCGTATCCGG CACTACCTGA AAGGCCGGCT GGCCCCGGAG CATCCCGGCA GGAAATGA
|
Protein sequence | MSVPATSPWS PLRQTTFRWL WLASIASNIG TWMHEVGAGW LMTTLSASPL NVALVQVAGS LPMFFLALPA GALADIVDKR RYLLGVQLWM ATVATLLAAL TLLGLTSVWL LLALTLCMGI GTALMMPAWS ATTPELVDKD ELPAAVALSS VGVNLARAVG PAIAGVLVSL VGPWLTFALN ALSFFAVIAV LLAWRRETEP AVLPAERLFG ALRAGWRYSR SSRPLQAVLV RALAFFLGAS AGMSLLPLIV RGELQGSATD FGLLLGCVGI GAVLGAAFLP RLHERLGGDR LVLLASLLYA LVLIALALLR DLYLLVPVML LSGAAWIAVL SSLQVAAQTS VPGWVRARAL AVYILVFFGS MAAGGTLWGL VASRASIPFA LLCAAALLAL GLLVALRFHL PVTEAEDLAP SLHWPAPILA EGLDRERGPV VVTLEYDIDP RRAAAFQQAM EEVRGMRRRN GAISWCLVQD SENPRQWLEF FIDESWLEHL RHHQRVTRGE LKIEAAARRF QTPGIDIRIR HYLKGRLAPE HPGRK
|
| |