Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2412 |
Symbol | |
ID | 9246262 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2862859 |
End bp | 2863971 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003680339 |
Protein GI | 297561365 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0744334 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGGTT CTGTCGAGGT TCGCTCCGAA GGGCGTCCGG TCAGCACGGG AGGGCCCAAA AGGCGTACGG TCCTGGCCGC ACTCCTCCTG CAACCCGGCA CCGTGGTCTC CGACAACCGG CTCATCGACC TGGTCTGGGG CGAACACCCT CCGCGCAGCG CACGGTCCCA GCTCCAGGCG CACGTGCACG AACTCCGCAA GGTCCTGGGC GCGGACACCA TCGTCCGGAG CACCTGCGGC TACCGGATGA CCGTGGCGGC CGAGGCCACG GACAGCGGCG TGTTCGAGCG GCTGCTGGTG CGCGCGCACT CCGAGCGCGC CGCCCGCCGC CCCTCCGATG CGGTCCGGAC CCTGCGCACC GCGCTGTCGC TGTGGACGGG CCCGGCGCTG GGCGGCGTGA CACCGGCGCT CGCCGCCCAC GCACGCCCGG CCCTGGAGGA GAGGCGGCTG CACGCCCTGG TGGAGCTGCA CGGCGCCGGG ATCGACCTCG GCCGCGGAGC CGCCGCCGTC CCCGAACTGC TCTCGCTGTG CGCGGAGCAC CCGACCCACG AACGCTTCGC GGGCCTGCTG ATGTCGGCCC TGCACGCCTG CGGACGCACC GGGGAGGCGC TGGAGGTCTA CGCGAAGCTG CGCGAGAGGC TGGCCGACGA ACTGGGCACC GGCCCCGGCG CCCGGCTCCG GCAGCTGCAC CTGGACCTGC TCACGGCCGG GCCGGAGGGC GGGAGCACGC GCGGCGGCGC GCCCGTCGCC CACCGGGTCC GCCCCGCCGA ACTGCCCTAC GGCGTCGGTG AGTTCGTCGG CCGCTCGGCG GAGCTGTCCG TGCTCGACCA GGCGCTGGAC GACGACCGGG GGGCCGACCG CTGGCCCGCG GTGTTCCTCC TCACCGGGGT CTCCGGCGTC GGCAAGACCG CGCTGGCCCT GCACTGGAGC CATGCCGTCC GGGAGCGCTT CCAGGATGGC CAGCTCTACG TGGACCTGCG CGGTTCCTCC TCCGGGGGGG GGAGCCCGTC CGGGCCGAGG ACGCGCTGCG CCAGCTGCTG CGCGGACTGG GCGCGGACCC CGGAGGCCTG CCGACCGGCG CCGACGAGCT CGCCAAGCTC TTCCGGTCGG TGA
|
Protein sequence | MLGSVEVRSE GRPVSTGGPK RRTVLAALLL QPGTVVSDNR LIDLVWGEHP PRSARSQLQA HVHELRKVLG ADTIVRSTCG YRMTVAAEAT DSGVFERLLV RAHSERAARR PSDAVRTLRT ALSLWTGPAL GGVTPALAAH ARPALEERRL HALVELHGAG IDLGRGAAAV PELLSLCAEH PTHERFAGLL MSALHACGRT GEALEVYAKL RERLADELGT GPGARLRQLH LDLLTAGPEG GSTRGGAPVA HRVRPAELPY GVGEFVGRSA ELSVLDQALD DDRGADRWPA VFLLTGVSGV GKTALALHWS HAVRERFQDG QLYVDLRGSS SGGGSPSGPR TRCASCCADW ARTPEACRPA PTSSPSSSGR
|
| |