Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4340 |
Symbol | |
ID | 4596858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4587365 |
End bp | 4588807 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 639778950 |
Product | major facilitator transporter |
Protein accession | YP_925524 |
Protein GI | 119718559 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.199167 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGAGC ACGTGGTGGC GCCACCGCGG GAGGAGGAGC CGGCCTACCA GCCGCACCCG CGGCGCTGGC GCATCCTCGG CGTCACGCTG GTCGTCGGCT TCATGGCGCT GCTCGACGTC TCGATCGTCA ACGTCGCGAT CCCGTCGATG CGCATCGGCC TGGAGACGTC CGCCGGGACC GTGCAGTGGG TCGTGTCCGG CTACGCGTTG GCGTTCGGCC TGACCCTGGT CGCCGGCGGC CGGCTGGGCG ACGCGTACGG CCGCCGCCGG ATGATGCTGA TCGGCCTGCT CTGCTTCGTC GCCTCGAGCG TCGCGGTCGG GTTCGCGCCC GACGTCGAGC TCGTGATCGC CGCCCGGCTG CTCCAGGGCG CGTCCGCCGG GCTGCTGACC CCGCAGAACT CCGGGCTGAT CCAGGAGCTC TTCCGCGACC GCGAGCGGGC GCGGGCGTTC GGCCTGTTCG GCTTCACCGT GTCGGTCTCG TCGGCGACCG GCCCCGTGCT CGGCGGCCTG ATCATCGCGC TGGCCGGCGA GGACGACGGC TGGCGCTGGC TGTTCTGGGT GAACGCACCG ATCGGCCTGG TCGCGATGGT CGCCGTGGCG CGGCTGGTGC CGGGCCCCGA CCCCGACGCG GCGGGGCGCG GCAGCAGCCG GGTCGACGTC CCCGGGGCGG TGCTGCTCGG TGGCGCGGTG CTGTGCCTGC TCTACCCCGT GGTGCGCATC GAGAGCGGCG CCCGGCTGCC GCTGCTGCTG CTCGCCGGGG TGCCGCTGTT CGCGTGGGCG TTCGTGTGGT GGGAGCGGCG TACGGCGCTG CGCGGCCGGC CGCCGCTGCT CGACATCGCG CTGCTGCGCC GGCTGCCCGG CTACCTCAAC GGGCTGCTGG TCGGGGCGCT GTACTTCACC GGGTTCACCG GCCTGCTGCT GGTGTTCTCG ATCTACCTCC AGGAGGGCCT CGGCCGCACC CCGCTCCAGG CCGGGCTGCT GATGATGCCG TTCGCCGTCG GGTCGGCGAT CAGCGCGCCG ATCGCCGGTC GCTACGTCTC GGACGCGGGC CGACGGCTCA CGGTCGGCGC GATCCTCGCG ATGATGACCG GCGTCGCCCT CCTGGCCGCC CTGGTCCCGG GCCGCGAGCC GCTGTGGCCG TGGCTGGTGC CGATCCTGCT GCTCACCGGC CTCGGCGGCG GCGCGGTCGT CTCGCCGAAC ATCACGCTGA CCCTGACCGA GGTGCCCCCG CGGATGGGCG GTGCGGCCGG TGGCGCGCTG CAGACCGGCC AGCGGATCGG GGCGTCGATC GGCGCCGCGC TGCTGGTGAC CGTCTACGGG CTCGTCGCCG GGTCGGCCAG CACCGGCACC GCGCTCCGCG CCGCGCTGCT CACGAGCGTG GTGGTGCTGT GCGCCGCGCT GGCCATGGCG GTGCGCGCCC TGCGCCAGCA GGACGCAGGC TGA
|
Protein sequence | MSEHVVAPPR EEEPAYQPHP RRWRILGVTL VVGFMALLDV SIVNVAIPSM RIGLETSAGT VQWVVSGYAL AFGLTLVAGG RLGDAYGRRR MMLIGLLCFV ASSVAVGFAP DVELVIAARL LQGASAGLLT PQNSGLIQEL FRDRERARAF GLFGFTVSVS SATGPVLGGL IIALAGEDDG WRWLFWVNAP IGLVAMVAVA RLVPGPDPDA AGRGSSRVDV PGAVLLGGAV LCLLYPVVRI ESGARLPLLL LAGVPLFAWA FVWWERRTAL RGRPPLLDIA LLRRLPGYLN GLLVGALYFT GFTGLLLVFS IYLQEGLGRT PLQAGLLMMP FAVGSAISAP IAGRYVSDAG RRLTVGAILA MMTGVALLAA LVPGREPLWP WLVPILLLTG LGGGAVVSPN ITLTLTEVPP RMGGAAGGAL QTGQRIGASI GAALLVTVYG LVAGSASTGT ALRAALLTSV VVLCAALAMA VRALRQQDAG
|
| |