Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4745 |
Symbol | |
ID | 9248627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5629263 |
End bp | 5631017 |
Gene Length | 1755 bp |
Protein Length | 584 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | cell envelope-related transcriptional attenuator |
Protein accession | YP_003682637 |
Protein GI | 297563663 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCTG ATGAGAAGCC CTCTGCCTCC GGCAGGGACC ACACACCAGA CGACAGGCCC GAGGACGGCT CCACACCCCG ATCAGGGACC GAGAACACCG GCGACGACGC GGCGGGTGGG GACAACACCA CCGCGTCCGA GACCGGGGCT GACGGAACCG GCCGGGCCCA CGAGTCGGAC GAGACCGAGG GGGCGGAGGA GGTCGGAAAG GCAGAAGAGG CCCCCAACAC CGGTGTGACA TCGGACACCG GAGGCGCCCA GGAGACGCAG CAGGCCCAGG AGACCCCGGA GGCTGACGAG CCTGCCGGTG CCGGGGAGGC TGCCGAGGCT GGCGACACCC GAAAGTCCAC CGGTACCGAA GGGGCCACCG AGACCGAGAA GGTCACCGGA GACGAGAAGG CCGTTGAGGC CGAGGAGGCC ACTGCGGAAT CAGCGGCTGG ATCCGCCGCC GGGGACGAAG AGCGCACTGG GACCAAGGCG CCCTCAGAGG CCGGTGGGGC TGGTGCCCCA GCCGGAGCCG CCGGGACCGG GGAGTCCGGA GAGACCGCGG AGACCGCTGG GGCCGCCCCG TCCGCCGCTC CCGTCCGCAC GAAGCGCCGC CGGACCGGCA GGATCCTGGT CTGGGTCGCC GCGAGCCTGG TCCTCGTCCT GGCCGCCGGG GTCGGCACCG CCTACGGCTA CTACCGCTCG CTGCGCTCGG ACATGGTCCA GTACGACATC GACGGGCTGC TCAAGGAGGA GGACCGGCCC GAGAGGATCA ACGACTCCGT CAACATCCTC TTCATGGGCA CCGACGGCTA CGAGGAGGGC AGCACCGCCT ACTCGACGGA GTTCGAGGGC GAGCGTTCGG ACTCGATCAT GCTGGCGCAC ATCTCACCCG AGAGCCGGGT GTCGGTGATC AGCTTCCCCC GCGACTCGCT GGTGGCCCTG CCCGACTGCG ACCCCTACGG AGAAACCGAG GGCACGCCCG GCTACTTCGG CATGATCAAC GCCGCGATGT ACCACGGCGG ACCGCCCTGC GTGGTCAGCA CCATCGAGTC GCTGAGCGAC GTCCGCATCG ACCACTTCGT GCACCTCAGC TTCATGAGCT TCCGGGACGT GGTGGACGCC ATCGGCGGCG TGGACATGTG CATTCCCGAG CCGATGGAGG ACAGCCGGTC CAAGCTCGAC CTCGACGCGG GCCAGCAGAC CCTCGACGGC GACGAGGCGC TGTCGTTCGT CCGGGCCCGC TACGAGATCG GCGACGGCGG CGACATCGGC CGCATCGACC GCCAGCAGAT GTTCCTCGCG GCCCTGGCCG ACCAGGTGAC CAGAAACGAC GTGATCACCG ACCCGGGCAG GCTCAACGCC GTTCTGCGCG CGGTGGCCGA GCACAGCGCC ACGGACAGCG CCCTCACGTT CGACCGGATG CTGTCGATCG CCGTGACCCT GGCGGACGTG GAGCTGACCG ACATCGAGTT CCACACCGTG CCCTGGTACC AGGCGCCCTC CAACCCCAAC CGGGTCCTGT GGTACGAGGA CCAGGCCGAG GAGCTGTTCA CCGCCGTGCG CGAGGACCGG CCCCTGCCCC TCACGATGGC CGACGAGGCG CCCGTTCCCC AGGACCCGCC CGGGGCCTCG CCCTCCCCGG CGGACGAGGA GGTCGCGGAG GCCTCCCCGG ATGACGAGCC CGCCCGTCCG GGCGTGGGAC GCGACGCCAC CTCCAACCCG TGCTCCGACG GCCTGGGCTA CGGCACCGGG GACGAGATGG AATAA
|
Protein sequence | MPADEKPSAS GRDHTPDDRP EDGSTPRSGT ENTGDDAAGG DNTTASETGA DGTGRAHESD ETEGAEEVGK AEEAPNTGVT SDTGGAQETQ QAQETPEADE PAGAGEAAEA GDTRKSTGTE GATETEKVTG DEKAVEAEEA TAESAAGSAA GDEERTGTKA PSEAGGAGAP AGAAGTGESG ETAETAGAAP SAAPVRTKRR RTGRILVWVA ASLVLVLAAG VGTAYGYYRS LRSDMVQYDI DGLLKEEDRP ERINDSVNIL FMGTDGYEEG STAYSTEFEG ERSDSIMLAH ISPESRVSVI SFPRDSLVAL PDCDPYGETE GTPGYFGMIN AAMYHGGPPC VVSTIESLSD VRIDHFVHLS FMSFRDVVDA IGGVDMCIPE PMEDSRSKLD LDAGQQTLDG DEALSFVRAR YEIGDGGDIG RIDRQQMFLA ALADQVTRND VITDPGRLNA VLRAVAEHSA TDSALTFDRM LSIAVTLADV ELTDIEFHTV PWYQAPSNPN RVLWYEDQAE ELFTAVREDR PLPLTMADEA PVPQDPPGAS PSPADEEVAE ASPDDEPARP GVGRDATSNP CSDGLGYGTG DEME
|
| |