Gene Ndas_2939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2939 
Symbol 
ID9246791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3511429 
End bp3512511 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content77% 
IMG OID 
Producttranscriptional regulator, TetR family 
Protein accessionYP_003680855 
Protein GI297561881 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0355871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCGGC TCACCAGGGC GCAGCGCCAG GCCCGCAACC GGGCGCGCGT GCTGTCCGCG 
GCCGGGGACG AGTTCGCCGA GCACGGTTTC CGCGACGCCA AGGTCGACCG TATCGCCGAA
CGGGTCGACC TCACCCGGGG CGCCGTCTAC TCCAACTTCC CCGGCAAGCG CGCCCTGTAC
CTGTCGGTGC TGGCCGACGC CGCCGAGCGT TCCGCCGACG CCGCCCGGCC CGAACCCGGG
CACAGCGCCC GCACGGCGCT GGGCGCGCTC GCCCGCGCCT GGGCCTCCCG GCTCCCCACC
ATCGGCGAGT CCTCGCTGAC CGGGACCGCG CTTCCGCCCG AGGTCCTCTC CGACGAGCCC
GTCCGGAGCG CCTTCGCCCA GCTCATGCGG TTGAACGCGC TCCTGCTCGG CCTGTCCTTG
GAGGCGTTGG CGCCCCCGTC CGTCCCCGGC GGACGCCGGG TGCGCGTGGC CGGGACCGTG
CTCACTACCC TGTACGGGAC CGGCCAGCTG GTCGGTGTCG CGCCCGGCTT CGCGGACCCC
TTCGCGGTGG TGCGCGCCTG CGAGCGCCTG GCCGACCTGG ACCTGGAGGA CTCCTGGCCG
CCGCCCCACC TGGAGCACGT GCGCCAGGCA GTGCCCGCCG ACGAGGAGTG GTCACCGCCG
GAGGCCTTCG ACGCGGTCCG CAGGCGCGCC GTGTCCCTGG CCGGGGACGG GATCGTGGCG
ATCCTGGGCA CGCACCGCCT CGAAGCGGCG GAGGAGGCGC TGCGCTCCGC GCCCGCGGGC
TCCCCCGTGA CCGCCGTGGT GGTCACGGGG GACCCCGACG AGCTGACCCC GCTCGCGCGG
CTGGCCGTGG CCGACCTGTG CGGCTGCCTG CGCCAGGCCT TCCCCGAGAG GGCCTGGCCG
CGCCTGCGCG TGGTGTTCGA CCCCTCCGGT GAGATCGCCG CGGCCGCGGG CGTGCCCGTG
GTCAGCGACG CCACCGAGAG CGCCGTCCGC GTCGTCGGCG GTCGGATCAC GGCCCGCTCC
GACGCCCGCG GCGCGGGCCA CGCCGTCGCC GCGCTCCTCG GCGCGCGGGC GGACCGCCGG
TAG
 
Protein sequence
MVRLTRAQRQ ARNRARVLSA AGDEFAEHGF RDAKVDRIAE RVDLTRGAVY SNFPGKRALY 
LSVLADAAER SADAARPEPG HSARTALGAL ARAWASRLPT IGESSLTGTA LPPEVLSDEP
VRSAFAQLMR LNALLLGLSL EALAPPSVPG GRRVRVAGTV LTTLYGTGQL VGVAPGFADP
FAVVRACERL ADLDLEDSWP PPHLEHVRQA VPADEEWSPP EAFDAVRRRA VSLAGDGIVA
ILGTHRLEAA EEALRSAPAG SPVTAVVVTG DPDELTPLAR LAVADLCGCL RQAFPERAWP
RLRVVFDPSG EIAAAAGVPV VSDATESAVR VVGGRITARS DARGAGHAVA ALLGARADRR