Gene Noca_2172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2172 
Symbol 
ID4599079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2324513 
End bp2326867 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content67% 
IMG OID639776774 
ProductMMPL domain-containing protein 
Protein accessionYP_923367 
Protein GI119716402 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAACCT TCCTGTACCG GCTCGGAAGA ACCGCGTTCG GCAAACCGTG GCTGTTCGTC 
GCCGGCTGGG TCGCAGTCCT CGCGGTGGTC GTTGGTGGCA TGGCAATCAA CGGGGTAAGC
GTCAGCTCCG AGATGAAGAT CGAGGGCACC GAGGCCCAGA CCGTGCTCGA CCGCGTGGCC
GATGAGCTCC CCGAGGCCTC GGGAGGCCAG GCCAGCGTGG TCTTCACCGT GCCGGACGGC
GAGCGCCTCG ACACTCCGGA GCGACTCGCG GTCATCAGCG GCACCGTCAG CGACGTCTAT
GACCTCGAGA AGGTCGTCAA CCCCCTCGAC GCCGCGCTGG GTGCCGCCGA GCAGGGAGGA
CCGGGCACCC CCCAGGAGAA TGCACCGGGC GATCCCCCAG CCGGGTCGGA CCAAGAACCG
GCACAGGAGC AGGGGCCTCC GTACCAGCCG CTGCTGGTGG ACGGGGAACC GGTGCCCGGC
GTGCTGGTGT CCTCGGACGG GCAGGTCGCG CTGTTCCAGT TTCAATTCAC GGTCGCCGCA
ACCTCACTGA CAGATGACGA CGTCACCTCG GTGGTCGAGG TGGTGGAACG CGCCGAGCAG
GGAACGGGGA TGACCGTTCT ACCGAGCGAC TCGCTCAAGG CCCTCGAGAT CCCCATCGGC
ATCGGCGAGG TGATCGGTCT CGCCGTCGCC GCCCTCGTGC TGGTGCTCAC CCTGGGTTCC
CTTGTCGCCG CCGGCCTGCC CCTGATCACC GCACTGGTCG GCGTCGGCAT CGGCGTGGGC
GGCGCATACG CGCTCTCGAA CGTCGTCGAG ATGAACTCCG CCACTCCCGT CCTCGGCCTC
ATGGTCGGGC TCGCCGTCGG CATCGACTAC GCACTGTTCG TCGTCAACCG GCAGCGACGA
CTGATCCTCG ACCAGGGACT CACCGCTCAG GAGGCAGCCG GCAGAGCAGT CGGCACCGCG
GGCAGTGCCG TGTTCTTCGC CGGCCTGACC GTCCTCATCG CGCTCACCGC GCTCACCGTC
ATCGGCATCG CAGTGCTGTC CACGATGGCA CTGGTCGCCG CGTCCACGGT GGCCCTGGCC
GTCCTCATCG CCCTGAGCCT TCTGCCGGCG CTGCTCGGTC TGGTCGGGGA GCGGATCTGC
TCCGACAAAG CCCGAGCCCG ACGCCGCACC AAGGTCGAGG CCGAGTCGCA CGGCGTCGCT
GACCATTGGG TCAAGGGTGT GATCAGGTTC AGGTGGCCTG TCATCGCCGG TGTGGTCGCG
ATCCTGGGCG TGATGGCGAT CCCCGCAGCC AGCATGCACC TGGGGATCCC TTCCGGCGCG
ACCGCCAACC AGGACACAGC CGCCCGCCAG AGCTACGAGG CAGTGTCCCA AGGCTTCGGC
GAGGGATTCA ACGGCCCCCT CCTGGTCACC GCCGAGCCCG TCGGCACCTC AGGCCGCGTC
ACGCCCGAGC TGACCGCGAA ACTAATCGGC GAGTTCCAGG ACCGAGGCGA CATTGTGCTG
GCCGCCCCCG TTGGCGTCAA CGAGGCTGGC GACCTGGCTG TGTTCAGCAT CATCCCCGCC
TCCGGACCCG ATGACGAGGC CACCAGCGAC CTCGTCAAAT CGCTACGCGA GCCCGGCAAC
GCCATCGCCC AGCGCAACCA GGTGCAGTTG GGCGTGACCG GGTTCACCGC CATCCGCATC
GACATGTCCG ACAAGATCGC CGGCGTTCTT CCCCTCTATC TCGGCATCAT CATCATCCTT
TCCATCCTGA TCCTGATGCT GGTCTTCCGC TCGGTCGTGG TCCCAATCAA GGCCACAGCG
GGCTTCCTGC TCAGCATCCT GGCCACCTTC GGTGCCACCA CTGCCGTCTT CCAGTGGGGC
TGGCTCAGCG GCCTCTTCGG GTTCGACACC GGCGGCCCGC TGATGAGCTT CATGCCGATC
ATCGTGACCG GCATCCTCTA CGGACTCGCC ATGGATTACG AGGTCTTCCT GGTCTCCTCG
ATGCGCGAGG CGCACATCCA CGGCCAAGCA GCCCGCCAGA GCGTCGTCCA CGGGTTCGAC
CAGGCCAGCC GGGTCGTGGT CGCAGCCGCC ATCATCATGG TCGCAGTGTT CTCCGGCTTC
ATCTTCAGCC ACGACATCAT GATCAAGCAG ATCGGCTTCG CCCTCGCCGC CGGCATCCTC
ATCGACGCCT TCCTCGTCCG GCTGACCCTC GTCCCGGCGC TCATGGCCGC CTTCGACGAG
CGAGCATGGT GGCTGCCCCG CTGGCTCGAC CACCTACTGC CGGACCTCGA CATCGAGGGC
GACAAGCTCT TGGCCATGCT CAACCAGCAG GCCGAACCCA CCGACCGAGA AGACAACGAC
ATCCGCAGCC GATGA
 
Protein sequence
MSTFLYRLGR TAFGKPWLFV AGWVAVLAVV VGGMAINGVS VSSEMKIEGT EAQTVLDRVA 
DELPEASGGQ ASVVFTVPDG ERLDTPERLA VISGTVSDVY DLEKVVNPLD AALGAAEQGG
PGTPQENAPG DPPAGSDQEP AQEQGPPYQP LLVDGEPVPG VLVSSDGQVA LFQFQFTVAA
TSLTDDDVTS VVEVVERAEQ GTGMTVLPSD SLKALEIPIG IGEVIGLAVA ALVLVLTLGS
LVAAGLPLIT ALVGVGIGVG GAYALSNVVE MNSATPVLGL MVGLAVGIDY ALFVVNRQRR
LILDQGLTAQ EAAGRAVGTA GSAVFFAGLT VLIALTALTV IGIAVLSTMA LVAASTVALA
VLIALSLLPA LLGLVGERIC SDKARARRRT KVEAESHGVA DHWVKGVIRF RWPVIAGVVA
ILGVMAIPAA SMHLGIPSGA TANQDTAARQ SYEAVSQGFG EGFNGPLLVT AEPVGTSGRV
TPELTAKLIG EFQDRGDIVL AAPVGVNEAG DLAVFSIIPA SGPDDEATSD LVKSLREPGN
AIAQRNQVQL GVTGFTAIRI DMSDKIAGVL PLYLGIIIIL SILILMLVFR SVVVPIKATA
GFLLSILATF GATTAVFQWG WLSGLFGFDT GGPLMSFMPI IVTGILYGLA MDYEVFLVSS
MREAHIHGQA ARQSVVHGFD QASRVVVAAA IIMVAVFSGF IFSHDIMIKQ IGFALAAGIL
IDAFLVRLTL VPALMAAFDE RAWWLPRWLD HLLPDLDIEG DKLLAMLNQQ AEPTDREDND
IRSR