Gene Ndas_5210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5210 
Symbol 
ID9249103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp361202 
End bp362506 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content76% 
IMG OID 
Productcopper resistance protein CopC 
Protein accessionYP_003683096 
Protein GI297564123 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.192134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.695526 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGACCCG CTCACGGCAC ACTCCGCCTC CTGCGCACGG CCCTTCTGGC CGCTGCGTGC 
ACGGGGCTTG CCTGGGGCGG GCACAACCTG TGGGCCGCGG CGCCCGCCTC CCCCGGAGGG
CTGGCCGCGG CGACCGCGGT CCTGTTCCCC CTGCTGTGGT ACTTCACCCG CACGATGCGC
GGGTTCGGCG ACATCTTCGC CGTCATGGCC TGCGTGCAGA TCCTCCTGCA CCTGGTGTTC
CAGGGTTCGG GCGAACCCGT CCCCGGGGTC CTCGACACCG GCGCCGACCA CACCGGGCAC
GGCCTGCTCA CCCACACCCT CGGCCTCGCG CCCGGCATGC TGTTCGCGCA CCTGTGGGCG
GCCCTGCTGG CGTCGGCGCT GCTCGCCCAC GGCGAGGCGG CCCTGTGGTT CCTGACCGCC
CTGCTCACCC GGGCCCTGCC GCCGCTGCGC GTGCCCGGCA TCGCGTTCGG CGCCCGCGTA
CCGGTCGCGT GCGCCGCCTC CCGTACCGCT CCCCCGCCCG TCGCGCTCTC GGCGCACGGC
CCGCGCGGAC CTCCCCTCCT CCCGGGCGTC CTTCCACGAC GCCCCCGCGC GCACGCCGGA
ACGTACCGAG AACTGAGGGA AGACATGACC ACCACCCGCA CCCCCGACAC CGCGGCCCCC
GCACGCGCGC TTGCGCGCGC GGCCGCCGCC GTACTCGCCC CCCTCGCCGC GGCCGCCCTG
GCCCTGGCCC CCTCGCCCGC GCTGGCGCAC GACGTCCTGA CCGGGTCCAA CCCGGAGGAC
GGGGCCACCC TGGACACCGT GCCCGAGGAG GTCGTGCTGA GCTTCAACAA CTCGCCGATG
GAGGGCGGCA GCGGCAGCGC CGTCGTCGTC ACGGGGCCGG ACGAGGAGAC CACCTACGAG
GAGGGGGACC TGACCTTCGA CGGCACGGAC GTGTCGGTGG GTCTGGCCCC GCTGGACCAG
GCCGGTGAGT ACACCATCGG GTTCCGCGTG GTCTCCTCCG ACGGCCACCC GATCCAGGAC
ACCCTGACGT TCTCGGTGAC CGAGGAGGCC GTCGCCGCGG CGGCGCCGGA GCCCGAGGAG
AGCGAGACCG CGCAGGAGCC CGCCGGGGAG ACGGCGCAGG AGCAGGCGCA GGGGCCCTCC
GGCGAGGCGA ACGCGGACGA GGCCGCGGCC GAGGAGGAGT CGGGCGGCGT GTCACCGGTG
GCGCTGGCGG TCGTGGCCGT CGTCGCGGTC GCCGGCATCG CCGCGGTCGT GCTCGTGGCC
GTGCGCATGC GCAGGCGCCC GGGCGGGGAC GCCGGGCAGA AGTAG
 
Protein sequence
MRPAHGTLRL LRTALLAAAC TGLAWGGHNL WAAAPASPGG LAAATAVLFP LLWYFTRTMR 
GFGDIFAVMA CVQILLHLVF QGSGEPVPGV LDTGADHTGH GLLTHTLGLA PGMLFAHLWA
ALLASALLAH GEAALWFLTA LLTRALPPLR VPGIAFGARV PVACAASRTA PPPVALSAHG
PRGPPLLPGV LPRRPRAHAG TYRELREDMT TTRTPDTAAP ARALARAAAA VLAPLAAAAL
ALAPSPALAH DVLTGSNPED GATLDTVPEE VVLSFNNSPM EGGSGSAVVV TGPDEETTYE
EGDLTFDGTD VSVGLAPLDQ AGEYTIGFRV VSSDGHPIQD TLTFSVTEEA VAAAAPEPEE
SETAQEPAGE TAQEQAQGPS GEANADEAAA EEESGGVSPV ALAVVAVVAV AGIAAVVLVA
VRMRRRPGGD AGQK