Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4948 |
Symbol | |
ID | 4595324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008697 |
Strand | + |
Start bp | 283347 |
End bp | 284357 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639772730 |
Product | bile acid:sodium symporter |
Protein accession | YP_919390 |
Protein GI | 119714248 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR01593] toxin secretion/phage lysis holin |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 0.805052 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00366916 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGTCAGTAG TGGGTGCCCT GGAGCGGCAC CAGATCTCCA TCTACCTCGG CGGCCTGGCG GCTGGAGCTG CCGTCGGACT GGCGTGGCCC GAGAGCTCGC ATCCGCTCGA GCTCGGCATC TACCCGGTAC TCGGGACCCT GCTCTACGCG ACGTTCCTGC AGGTGCCGTT CACCAAGTTG GCCGGCGCGT TCCGAGATAC CCGGTTCCTC GCCTCAGCGC TGGTCTTGAA CTTCGCGGTC GTGCCGCTGG TGGTCGGCGC GCTCACGGCC TTGGTGCCGC TGTCCCAGGC GGTGCTCCTC GGTGTCCTGC TGACCCTGCT GACGCCGTGC ATCGACTACG TGATCGTGTT CTCCGGGCTC GCTGGCGGCG ACAGTCAGCG CCTGGTCGCC GCCACGCCGC TGCTGATGCT GGCCCAGCTG CTGGCCCTGC CGGTCCTGCT GTGGCTGTTC GTAGGCCCTG AGCTGGCCGA CATCGTCGAG GTCGGGCCGT TCCTAGAGGC GTTCGGGGTC CTGATCGTGC TCCCGCTCGC GTTGGCCTGG GCCACCGAAG CCCTCGCGGC ACGCCACCGG ACGGGTCAGG CGATCACCGG CGCGATGACC GCGGCGATGG TTCCCCTGAT GGCCGCCACC TTGTTCGTCG TGGTCGGCAG TCAGGTCCCC AAGCTCGAGG GTCGGTTCGA CGAGATCATC ACCGTCGTCC CGATCTACGC CGCGTTCCTG TTGATCATGG CCTTCCTCGG GCTCGCCGCC GCGCGCACCG CTCGACTCGA CACAGGACGC GCCCGGGCAC TGATCTTCAC CGGCGCCACC CGCAACTCGC TCGTGGTCCT CCCGCTCGCG CTGGCCCTCC CCGCGGGCTA CGCCATCACC CCGGCCATCG TGGTCACCCA GACCCTCGTC GAGCTCATCG GGATGCTCGT CTACATCCGA CTCGTTCCGC GGCTGGTCCC GGTAACCTCG ACACCGAAGA CGGTCAACGA CAGCCGGGAT GGATTGGCTC CAGACGTTTG A
|
Protein sequence | MSVVGALERH QISIYLGGLA AGAAVGLAWP ESSHPLELGI YPVLGTLLYA TFLQVPFTKL AGAFRDTRFL ASALVLNFAV VPLVVGALTA LVPLSQAVLL GVLLTLLTPC IDYVIVFSGL AGGDSQRLVA ATPLLMLAQL LALPVLLWLF VGPELADIVE VGPFLEAFGV LIVLPLALAW ATEALAARHR TGQAITGAMT AAMVPLMAAT LFVVVGSQVP KLEGRFDEII TVVPIYAAFL LIMAFLGLAA ARTARLDTGR ARALIFTGAT RNSLVVLPLA LALPAGYAIT PAIVVTQTLV ELIGMLVYIR LVPRLVPVTS TPKTVNDSRD GLAPDV
|
| |