Gene Noca_4948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4948 
Symbol 
ID4595324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp283347 
End bp284357 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content68% 
IMG OID639772730 
Productbile acid:sodium symporter 
Protein accessionYP_919390 
Protein GI119714248 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR01593] toxin secretion/phage lysis holin 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value0.805052 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00366916 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGTCAGTAG TGGGTGCCCT GGAGCGGCAC CAGATCTCCA TCTACCTCGG CGGCCTGGCG 
GCTGGAGCTG CCGTCGGACT GGCGTGGCCC GAGAGCTCGC ATCCGCTCGA GCTCGGCATC
TACCCGGTAC TCGGGACCCT GCTCTACGCG ACGTTCCTGC AGGTGCCGTT CACCAAGTTG
GCCGGCGCGT TCCGAGATAC CCGGTTCCTC GCCTCAGCGC TGGTCTTGAA CTTCGCGGTC
GTGCCGCTGG TGGTCGGCGC GCTCACGGCC TTGGTGCCGC TGTCCCAGGC GGTGCTCCTC
GGTGTCCTGC TGACCCTGCT GACGCCGTGC ATCGACTACG TGATCGTGTT CTCCGGGCTC
GCTGGCGGCG ACAGTCAGCG CCTGGTCGCC GCCACGCCGC TGCTGATGCT GGCCCAGCTG
CTGGCCCTGC CGGTCCTGCT GTGGCTGTTC GTAGGCCCTG AGCTGGCCGA CATCGTCGAG
GTCGGGCCGT TCCTAGAGGC GTTCGGGGTC CTGATCGTGC TCCCGCTCGC GTTGGCCTGG
GCCACCGAAG CCCTCGCGGC ACGCCACCGG ACGGGTCAGG CGATCACCGG CGCGATGACC
GCGGCGATGG TTCCCCTGAT GGCCGCCACC TTGTTCGTCG TGGTCGGCAG TCAGGTCCCC
AAGCTCGAGG GTCGGTTCGA CGAGATCATC ACCGTCGTCC CGATCTACGC CGCGTTCCTG
TTGATCATGG CCTTCCTCGG GCTCGCCGCC GCGCGCACCG CTCGACTCGA CACAGGACGC
GCCCGGGCAC TGATCTTCAC CGGCGCCACC CGCAACTCGC TCGTGGTCCT CCCGCTCGCG
CTGGCCCTCC CCGCGGGCTA CGCCATCACC CCGGCCATCG TGGTCACCCA GACCCTCGTC
GAGCTCATCG GGATGCTCGT CTACATCCGA CTCGTTCCGC GGCTGGTCCC GGTAACCTCG
ACACCGAAGA CGGTCAACGA CAGCCGGGAT GGATTGGCTC CAGACGTTTG A
 
Protein sequence
MSVVGALERH QISIYLGGLA AGAAVGLAWP ESSHPLELGI YPVLGTLLYA TFLQVPFTKL 
AGAFRDTRFL ASALVLNFAV VPLVVGALTA LVPLSQAVLL GVLLTLLTPC IDYVIVFSGL
AGGDSQRLVA ATPLLMLAQL LALPVLLWLF VGPELADIVE VGPFLEAFGV LIVLPLALAW
ATEALAARHR TGQAITGAMT AAMVPLMAAT LFVVVGSQVP KLEGRFDEII TVVPIYAAFL
LIMAFLGLAA ARTARLDTGR ARALIFTGAT RNSLVVLPLA LALPAGYAIT PAIVVTQTLV
ELIGMLVYIR LVPRLVPVTS TPKTVNDSRD GLAPDV