Gene Noca_3744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3744 
Symbol 
ID4598606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3963347 
End bp3964438 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content71% 
IMG OID639778352 
Productarsenical-resistance protein 
Protein accessionYP_924931 
Protein GI119717966 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.171539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGACA CTGCAGCGGC GCAGGTCCGT GCCGACGTCG TCGGCCGGCT CCCCACGCTC 
GACCGGTACC TGCCGGTGTG GATCGGGCTG GCGATGGCGG CCGGGCTGCT GCTCGGCCGC
TGGGTGCCCG GCATCGCCGA CGTCCTCGAC GCGATCACGA TCGCCTCGGT GTCGCTGCCG
ATCGCGCTCG GGCTGCTGGT GATGATGTAC CCCGTCCTCG CCAAGGTCCG CTACAACGAG
GTGGGCGACG TCGCTCGCGA CACCCGCATG ATGGCGCTGT CGGTGGTGCT GAACTGGGTC
GTCGGTCCGG CCCTGATGTT CACCCTGGCC TGGGTGTTCC TCGCCGACCT GCCCGAGTAC
CGCACCGGGC TGATCATCGT GGGCCTGGCC CGGTGCATCG CGATGGTGAT CATCTGGAAC
GACCTCGCCT GCGGGGACCG GGAAGCGGCG GCCGTCCTGG TCGCCCTGAA CTCGGTCTTC
CAGGTGCTCG CCTTCGCCCT GCTCGGCTGG TTCTACCTCG ACCTGCTGCC CGGCTGGCTC
GGCCTCTCCG GCACCGGGCT GGAGGTGTCG CCCTGGCAGA TCGCCTGGAG CGTGGTGGTG
TTCCTCGGCA TCCCGCTCGC TGCCGGCTAC CTCAGCCGCC GGGCGGGAGA ACGACGCCGC
GGCCGCGAGT GGTACGAGCA GCGGTTCCTG CCACGGATCG GGCCGTGGGC GCTGTACGGA
CTCCTGTTCA CCATCGTGGT GCTGTTCGCA CTGCAGGGCG ACACCATCAC CAACCAGCCG
GCCGACGTCG CGCGCATCGC CGTCCCCCTG GTCGTCTACT TCGCCCTGAT GTGGGGCGGG
TCGATGCTGG CCGCCCACCG TGCGGGGCTC GGCTACCGCC GATCCACCAC GGTCGCCTTC
ACGGCAGCCG GCAACAACTT CGAGCTCGCG ATCGCGGTGG CCATCGCCGT GTACGGCGTC
ACCAGCGGGC AGGCGCTCGC GGGAGTCGTC GGCCCGCTGA TCGAGGTGCC CGTCCTCGTC
GGCCTGGTCT ACGTGAGCCT CTGGGCCCGC CGCTTCTTCC CCGACACCGT CCAGGAGGAC
CTACCCCGAT GA
 
Protein sequence
MSDTAAAQVR ADVVGRLPTL DRYLPVWIGL AMAAGLLLGR WVPGIADVLD AITIASVSLP 
IALGLLVMMY PVLAKVRYNE VGDVARDTRM MALSVVLNWV VGPALMFTLA WVFLADLPEY
RTGLIIVGLA RCIAMVIIWN DLACGDREAA AVLVALNSVF QVLAFALLGW FYLDLLPGWL
GLSGTGLEVS PWQIAWSVVV FLGIPLAAGY LSRRAGERRR GREWYEQRFL PRIGPWALYG
LLFTIVVLFA LQGDTITNQP ADVARIAVPL VVYFALMWGG SMLAAHRAGL GYRRSTTVAF
TAAGNNFELA IAVAIAVYGV TSGQALAGVV GPLIEVPVLV GLVYVSLWAR RFFPDTVQED
LPR