Gene Ndas_3295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3295 
Symbol 
ID9247157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3929602 
End bp3931671 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content79% 
IMG OID 
ProductTetratricopeptide TPR_4 
Protein accessionYP_003681207 
Protein GI297562233 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCTGC GCGCCCAGCC CTCCCTCCTC TCCCCCGAGT CCCCCCTCTC CGCCGCCTGC 
CCGGACCACC ACGCGCTGCG GTCCGGCAGG TCCCTGCTCG CCCGCGTGAG CCACGGAGTG
CGGCTGGCGC AGGGTGGCAT GTTCGAGCAG ACCGCGGCCC TGCTGGACGA GGCCCATCGG
CGCCTGCTCG GCCACCCGGT GGCCGAGTCC GCGACGGTGA TGCCCGGCGT CCTGCTCAAC
CTGGGACTGG CGCAGACGCT GTGCGGCCGC TTCCCGCAGG CCGAGGAACA CCTGCGCGAG
GCCCGGTCCC TGGCACGGGA GCGCCGTCTG CCGCTGATGG GCCTGGTCAT CGACCAGAAC
CTGGGCTGCC TCAGCCTCTA CCGGGGCGAC GCCCCCGCGG CCATCGCCAC CTTCCACGGC
CTGACCGACC GTCTGCCCTC CGAACGCCGT GAGGCGCTGC ACGTGGACCT GGCCGAGGCG
CTGCTGGCCG AGGGCCTGGT GGAGGAGGCG GCCACGGCGC TCGCGGACGG CCCCTGGGCC
GGGGGTTCGT CGGCCAACGC CCTGCTGGTG GAGGCCAAAC TGCGGCTGCT GCGGGGCGAC
CACCGCCACA CGGTCGAGCT CACCCGCCGG GTGCGGCACT CCTTCGGTAC GGGCTCGCTG
TGGTACCGGC TGGCCACCCG CCTGGAGAGG ATGGCGCTGC GCGAGGGCCG GACGACCTCC
CCCGTGGCCC GGGCCAGGAC CGCCCTGGCC GTCCGCGCGC CCCTGGCCGC GCCGCGACCG
TCGGGGAAGG GTGCCGCCCA CGTGGCCCTG GACACCCTGG ACGCGCACGC CCGCCTCGCG
CCGGGTCCCT GGTTGGCGGG CGCGGTGAGC GATCCCCACG TCGTCCGCGC GGGCCTGGAG
AGCGCCCTGC GCGCCGGAGA GGCGGCCACC GCCCTGGAGT GGGCGGAACT GGCCCGGACC
TGGGCCGCGC CCCACGTCCC CGGTCCCGGC GTACGCACAC CCGCGACGGC CTCGCTGGCC
GACCGCTACC GTGCGGCGCT GGTCCGCGGC CGCGACCCGC ACGTCCTGGC CCGGCGCCTG
GAGTCGGCCC GCTGGCAGGC GCACCACCAG ACCCCGGGCG CGCGCCGCGC CCCCGCACCC
GCGCCCGTGG CGGGCGCGCT GCTGGAGCGG CTGGGCGACC GGGCCTTCGT GCGGTACACG
CGGGCGGAAG GGAAGGCCGT CGCGCTCGTG GCCGCGGCCG GACGGGTCCA CGCCCGCGTC
CTGGGGCCGC TCCCGCGGGT GGCCCGTGCG ATGGCCCGGT TCGTCCACCC GGTGGCGTTC
GCACAGGACG GCTGCGAGGC GCGGGAGGCG GCCGAGGACG TCGCACGTGT CCTCCTCGAA
CCGCTGCTGG CGCTGGTCGG CGACCTTCCG CTGGTCGTGG CCGGCGACTC CTACCTCGGC
GATCCCCCCT GGGGCATGCT CCCCGCGCTG CGGGGCCGTC CGCTGAACCT GGTGCCCGGC
GCCGGGTTCT GGCTGGACCG GACCCGCTCC GGGGTACCCG TGCCCCGCCC GCCGGAGCGG
GTGCTGCTGG TGGCGGCTCC CGAACCGGAC GGGGCGGCGC GCGAGGTCGC CGCGCTGGCC
GACGTGTACC CGGGGGCGCG CGTGCTCCGT GCCGACCGGG CCCTGCGCTC CGACGTCCTG
GCCTCCCTCG GCCGGGCCGA CCTGGTCCAC CTGGCGGGGC ACGGGCGGGT CCCCGGCCGC
TCGCCCATGC TCGCCTCGGT CGGCCTGGGC GACGGGCCGC TGCTCGCCTG CGACCTGGCC
GGGCTGCCCG AGGCCCCGGA CACGGTGGTG CTGTCGACCT GCTGGAGCGG CCGGGGCTTC
GCGGACCGCG CCGGGGCTCC GCTCGGGTTC ACCGGCGCGC TGCTCGCGGC CGGGGTCCGC
ACGGTGGTGG CCAGCCCCGT CCCCGTCCGG GACGCGGGGA CCGCCGGGGC CATGCGGCTG
TTCCACCGCG CGCTCGCCGC GGGCGTCCCC GCGCCCGAGG CGGTGGCGGT CCACCTCGGG
CGGACGGGGT TCTGCTGCTT CGGCGCCTGA
 
Protein sequence
MSLRAQPSLL SPESPLSAAC PDHHALRSGR SLLARVSHGV RLAQGGMFEQ TAALLDEAHR 
RLLGHPVAES ATVMPGVLLN LGLAQTLCGR FPQAEEHLRE ARSLARERRL PLMGLVIDQN
LGCLSLYRGD APAAIATFHG LTDRLPSERR EALHVDLAEA LLAEGLVEEA ATALADGPWA
GGSSANALLV EAKLRLLRGD HRHTVELTRR VRHSFGTGSL WYRLATRLER MALREGRTTS
PVARARTALA VRAPLAAPRP SGKGAAHVAL DTLDAHARLA PGPWLAGAVS DPHVVRAGLE
SALRAGEAAT ALEWAELART WAAPHVPGPG VRTPATASLA DRYRAALVRG RDPHVLARRL
ESARWQAHHQ TPGARRAPAP APVAGALLER LGDRAFVRYT RAEGKAVALV AAAGRVHARV
LGPLPRVARA MARFVHPVAF AQDGCEAREA AEDVARVLLE PLLALVGDLP LVVAGDSYLG
DPPWGMLPAL RGRPLNLVPG AGFWLDRTRS GVPVPRPPER VLLVAAPEPD GAAREVAALA
DVYPGARVLR ADRALRSDVL ASLGRADLVH LAGHGRVPGR SPMLASVGLG DGPLLACDLA
GLPEAPDTVV LSTCWSGRGF ADRAGAPLGF TGALLAAGVR TVVASPVPVR DAGTAGAMRL
FHRALAAGVP APEAVAVHLG RTGFCCFGA