Gene Noca_3502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3502 
Symbol 
ID4595601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3710699 
End bp3712123 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content74% 
IMG OID639778110 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_924689 
Protein GI119717724 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGCTC TCGACGTGGC GGTCGCACCG GCCCCGACGC GGTCCCGACG CTCGCCGGTG 
TGGCTCGCCG TCCTCGCGGC CTCGCTGCCG ATGTTCATGG CCACCCTTGA CAACCTGGTG
ATGACCAGCG CGCTGCCGGT GATCCGGACC GACCTCGGCT CGTCGGTCAA CCAGCTCTCC
TGGTTCATGA ACGCCTATAC GCTGACCTTC GCGACCTTCA TGCTCCCCGC CGCGACCCTC
GGCGACCGGC TCGGCCGACG GCGGATGATG CTCGCGGGCC TGACCGTCTT CACGCTCGCG
TCGGTCGCGT CGGCGCTGAG CACCACCTCG GAGGCGCTGA TCGCGGCGCG CGCCGTGCAG
GGCCTCGGTG CGGCGGCGAT CATGCCGCTC TCCCTGACCC TGCTCGCCTC CGCCGTGCCC
CCGGCCCTGC GCTCGGCCGC CATCGGGATC TGGGGTGGCG TCAGCGGGCT CGGTGTCGCG
CTCGGTCCCG TGGTCGGCGG CGCGGTCGTC GAGGGCGTCA GCTGGCAGGC GATCTTCTGG
CTCAACGTCC CGGTGGCCGC GGTCGCGGGG CCGCTGCTGG TCCTCGGGGT GCGCGAGTCG
CACGGCGCCT GGCAGCGGCT CGACCTGGTC GGGACCCTGC TCGTCGGCGG CGCGGTCCTC
CTCGGGATCT GGGGCATCGT GCACGGCAAC GACGACGGCT GGGGCGATCC GCGGGTCCTC
GGCCCGCTCG TGGTCGCTGC GCTGCTGGCG CCGGCGTACC TGCGCTGGGC CCGAGGTCGT
TCCCACGCGG TCCTGCCGCT GCGGCTGTTC GCCGCGCGTG GGTTCTCCGT CGCGAACGTG
ATCGCCCTGT TCTTCACCAT CGGGATGTTC GGGACGGTCT TCCTGCTCAC GCAGTACCTC
CAGGTGGTCC AGGGCTACAG CCCGCTCGCC GCTGGCGTGC GCACGCTGCC GTGGACGGCC
GCACCGATGG TGGTCGCGCC GCTCGCCGGC CTGCTGGCGC CGCGCACCGG CTTGCGGGCG
CTGCTCCTGA CGGGGCTGGC GCTCCAGACC GGGTCGCTGG TCTGGTTCGC GGTGCTCACC
GAGACCGCTG CGGGCTACCC GGCGTTCATG CCGGCGCTGC TGATGGCCGG GGTCGGGATG
GGGCTGACGT TCGCACCGAT GGCGACTGCC GTGCTCGAGG GCCTGCCCGA GGAGGACTTC
GCCATGGCCA GCTCGGCCAA CTCCACGATC CGCGAGTTCG GGGTCGCGCT CGGCATCGCC
GTGCTCACGG CGGTCTTCCT CGGCAACGGC GGCGCGATCG AGCCGCTCGG GTACGACGGC
GCGATCGGCC CGGCGCTGCT GACCGGCGCC GGGGCCGTGG CGGTTGCGAC ACTCGCCGCG
CTGCTCGCTC CCGGCAGGGG GAGGCGGGCC ACCCCTCGGG CCTGA
 
Protein sequence
MTALDVAVAP APTRSRRSPV WLAVLAASLP MFMATLDNLV MTSALPVIRT DLGSSVNQLS 
WFMNAYTLTF ATFMLPAATL GDRLGRRRMM LAGLTVFTLA SVASALSTTS EALIAARAVQ
GLGAAAIMPL SLTLLASAVP PALRSAAIGI WGGVSGLGVA LGPVVGGAVV EGVSWQAIFW
LNVPVAAVAG PLLVLGVRES HGAWQRLDLV GTLLVGGAVL LGIWGIVHGN DDGWGDPRVL
GPLVVAALLA PAYLRWARGR SHAVLPLRLF AARGFSVANV IALFFTIGMF GTVFLLTQYL
QVVQGYSPLA AGVRTLPWTA APMVVAPLAG LLAPRTGLRA LLLTGLALQT GSLVWFAVLT
ETAAGYPAFM PALLMAGVGM GLTFAPMATA VLEGLPEEDF AMASSANSTI REFGVALGIA
VLTAVFLGNG GAIEPLGYDG AIGPALLTGA GAVAVATLAA LLAPGRGRRA TPRA