Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3966 |
Symbol | |
ID | 4598101 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4179789 |
End bp | 4183016 |
Gene Length | 3228 bp |
Protein Length | 1075 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639778571 |
Product | major facilitator transporter |
Protein accession | YP_925150 |
Protein GI | 119718185 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGTACAG GTGGACGTGG GATGGTCGGT CGGTCACCAG GAGCCGCCGC CCCCGCCGCC GCCTCCGCCG CCACCCGAGA ACCCGCCGCC GCCCGAGGAC GACGACTTCT GGGTGGCCTC GTAGGAGGAG ATCGCGGAGC TGACGGTGGA GTGGAAGTCG TCGACCATCG ACGAGGCCAC CGAGCCGGTG TAGGCGCCCG CGTACGCGGG ACCGAGGTAC GACGGGACCG GCGGCTCGGC GCCCATCTCG GTGCGGTACT TGCCCGCCCA CTCCTCGGCG CACCCGAACG CCACCGCCCA GGGCACGTAG GCGGTGTAGA ACTCCTCGCG TCCGGAGAAG TCGAACCGGT CCTGTGCGGA CGGCGTCGCC AGGATCCGGC GGAACCCGCC GGCCCGCGAC CAGAGGTCGC GGCCGGCCTT GGTGCGCCGG GTGCCCGCGC CGGAGGCCGC GAGGGAGGCG CCGAAGACCC CGAAGGCGCC CGGCACCAGC CCCACGATCG ACATGTTCAG CGGGTTCCAG ATCGCGATCG CGACCACCGC GGCGAGGCAG AGGCCGATGA CCACTCCGCC ACCGCCGCCC AGCCCGCTCC TGACCATCAG GCCGGAGGTG GCGGCCCAGG TCCGGGTGCT CTCCTCGAAC GACTCGATCT CGGTCTTGAG CCGCTGACCG GCCGCGACGT CCTTGGGCGC GGCGACGAAC GAGGTGCCCG GGCCGCCGAG CAGGTGTGCC ACCCCACTGG TCACGGGATC GAGGCCGGCC CAGCCCGCTG CGCCGTTCTT GTCCGAGATC GTCCACGTGC CGTCGGCGCG GTCCAGGTCG ACGGCACCCT TCTCGGCGGC GTACATGAGG GTGGCGACGT ACTGCTCGTC GTCGACCTCC TCGGTGACCA GGTAGGCGGC CTGGGCGGGG CCGATGCCGT CCGGCGGCGC GTACATCACC GGGAACTGCG GGTCCTTCTC CCTGGCCCGC CGGGCCAGGT GGAAGCCGTA GCCGCCGGCC GCGGCGGCCA GGACGAGCAC GACGGCCAGC GCGGCCAGGT TGGGCCCGAG CACGGCGTCC AGCGCCGGGC CCCAGGGTCG CTCGTCGCCG GGCGGAGGCG TTGCCAGGTC CAGGCCCACC TTGACCGTCA CGGGGGTGCG CGGGTCGAGC GACGTCGCCC GGATCCGCAG GTCCGCGGTG CCCTCCCCTC GGAGCCGGCA GCCCGTCTCG TCGGAGCCGA CGGCGCACTG GACCCGCTCG GCCGCGGCGG GCAGGTGGAC CGTGAGGTCG GCCGAGTCGA TGCGCTGGGC CCAGCCGCTG GGGATCAGGT TCCAGTAGAG CTGGGTGCGG GAGCCGTTGG TGCCCGGCTC CAGGATGCCG GCGATGTCGT AGCGGATCTC GTACACGTGC TCGCCGGGCT CGACGGTCGA GTCCGGATCG CCGATGCGGG CGACCTTGAA CTGGCCACCG CCCTCGTAGG ACGTCTCGAC CGGGACGTCG GCACCGTCCA TCGTGACCTC GATGCCGCGG GGGATCCGGC GGACGGTGTC CGGGGCCGTC TGGTCGTGGG TGTCCCAGAA CCGGAAGATC CCGTGCTTGC CGGGGAACGG GAAGTCGACC GTGAGGGTCT CGACGGCCGT CAGGTCGCCC TGGTCGTCCA CGTCGAAGTC CGCGACGTAC GACGTGATCG TGGTCTCGTC CGACTCGGCG GGTCCCTCGT CGCCGCTCAC CCCGTAGAGC GCCGCGGGGA GCATCAGCAC CAGCACCACG ACCGCCAGGC CGACCACGGT CCCCACCACA CGCTTCATGG CCCGGAGCCT AGGGGGCGAC GGGACAGCCG CGTCGGAAAA CCGCGTGCCG GGTGTCGCCG GCCGCTGGGA GACTGCTCGG CGATGAGCTC GCCGACCCGC CCCGACGCCG ACCGCCCGAC CCTCGCCTCC TTCTGGCACG ACCTGCCCCG CGAGGGCCGG CTGCTGCTCT CGGTCGTGGT CTTCGAGTTC ATCGGCACCG GCCTGGTGCT GCCCTTCCAC GTCGTCTACC TGCACGAGGT GCGCGGCTTC GCGCTCAGCG ACGTCGGGCT GCTGCTCGCG CTCCCGCCCC TGATCGGCTT CCTCGTCGTC GGGCCCGGCG GTACCGCGAT CGACCGGCTC GGCGCGCGCC GGATCCTGAT CGGCGCGCTG GTCCTCCAGA CCGTCGCGAA CGTCACACTC GCGTTCTCGG CGGCGGAGTG GATGGCGGCG GGGGGGCTGA TGCTCTCCGG CGCGGCGTTC GGGGTGTCGT GGCCGGGGTT CCAGGCCTTC ATCGCCGCGG TGGTCCCGGT CGAGCTGCGG CAGCGCTACT TCGGCGTGAA CTTCACGCTG CTCAACCTCG GCATCGGGAT CGGCGGCATC GTCGGCGGCG CGTTCGTCGA CGTGGACCGG CTGGTCACCT TCCAGGTCAT CTACCTCGGC GACGCGATCA GCTACCTCCC CGCCCTGGTC CTCCTGCTCT GGCCGCTGCG GCTGGTCGCC GGTCGGCCGG TCCACGAGGG CGGCGCCCCG CCGGCGACGG TGAGCTACCG CGAGGTGATG CGTCGGCCCG CGGTCGCCTC GCTGATGCTG CTCAGCTTCG TGTCGTCGTA CGTCGGCTAC TCCCAGCTCA ACGCCGGGAT GCCGGCGTTC GCGCGCGCGG TGGGCGAGGT CTCGACGCAG GGCCTCGGGC TGGCGTTCGC CGCGAACACC GTGGTGATCG TCGTGCTCCA GCTGGTCGTG CTCCAGCGGA TCGAGGGGCG GCGCCGCACC CGGGTGATCG CGGTGATGTC GGTGGTCTGG GCGTGCTCCT GGGTGCTGCT CGGCGCCACC GGGCTGGTCT CCGGCACGTG GGGCGCGACG CTCCTGGTCG CCGGCTGCGC GTCGGTGTTC GCGTTCGGCG AGACCCTGCT GCAGCCGACC GTCCCCGCCC TCGTCAACGA GCTGGCCCCC GACCACCTGC GGGGGCGCTA CAACGCGCTC AGCTCCGGGT CCTTCCAGCT CGCCGCGATC ATCGCACCGC CGGTCGCCGG CTACCTCGTC GGCCACGGCC TGGGCAGCGT CTACATCGGC TCGCTCGTCG TCGGCTGCCT GCTCTGCGGC GCGCTGGCGG TCCTGCGGGT CGAGCCACAG CTGAGCCCCG AGGTCAACGG TGTGCGAGCT CCGGCGCAGG TCACCACGGC CGCGGACGTC ACCGTCCCCG TGCCGACGGC CAAGACCCAG TCCAGCGCCC TGGACTAG
|
Protein sequence | MGTGGRGMVG RSPGAAAPAA ASAATREPAA ARGRRLLGGL VGGDRGADGG VEVVDHRRGH RAGVGARVRG TEVRRDRRLG AHLGAVLARP LLGAPERHRP GHVGGVELLA SGEVEPVLCG RRRQDPAEPA GPRPEVAAGL GAPGARAGGR EGGAEDPEGA RHQPHDRHVQ RVPDRDRDHR GEAEADDHSA TAAQPAPDHQ AGGGGPGPGA LLERLDLGLE PLTGRDVLGR GDERGARAAE QVCHPTGHGI EAGPARCAVL VRDRPRAVGA VQVDGTLLGG VHEGGDVLLV VDLLGDQVGG LGGADAVRRR VHHRELRVLL PGPPGQVEAV AAGRGGQDEH DGQRGQVGPE HGVQRRAPGS LVAGRRRCQV QAHLDRHGGA RVERRRPDPQ VRGALPSEPA ARLVGADGAL DPLGRGGQVD REVGRVDALG PAAGDQVPVE LGAGAVGARL QDAGDVVADL VHVLAGLDGR VRIADAGDLE LATALVGRLD RDVGTVHRDL DAAGDPADGV RGRLVVGVPE PEDPVLAGER EVDREGLDGR QVALVVHVEV RDVRRDRGLV RLGGSLVAAH PVERRGEHQH QHHDRQADHG PHHTLHGPEP RGRRDSRVGK PRAGCRRPLG DCSAMSSPTR PDADRPTLAS FWHDLPREGR LLLSVVVFEF IGTGLVLPFH VVYLHEVRGF ALSDVGLLLA LPPLIGFLVV GPGGTAIDRL GARRILIGAL VLQTVANVTL AFSAAEWMAA GGLMLSGAAF GVSWPGFQAF IAAVVPVELR QRYFGVNFTL LNLGIGIGGI VGGAFVDVDR LVTFQVIYLG DAISYLPALV LLLWPLRLVA GRPVHEGGAP PATVSYREVM RRPAVASLML LSFVSSYVGY SQLNAGMPAF ARAVGEVSTQ GLGLAFAANT VVIVVLQLVV LQRIEGRRRT RVIAVMSVVW ACSWVLLGAT GLVSGTWGAT LLVAGCASVF AFGETLLQPT VPALVNELAP DHLRGRYNAL SSGSFQLAAI IAPPVAGYLV GHGLGSVYIG SLVVGCLLCG ALAVLRVEPQ LSPEVNGVRA PAQVTTAADV TVPVPTAKTQ SSALD
|
| |