Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1938 |
Symbol | |
ID | 9245788 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2361220 |
End bp | 2363436 |
Gene Length | 2217 bp |
Protein Length | 738 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | carbon starvation protein CstA |
Protein accession | YP_003679871 |
Protein GI | 297560897 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.121526 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTCTG TCCCGACCCG CGGCCCAGCG GAGGCCGGGT CCCCCCGGCC GCGGTGGACG CCCCGCAGCA TCGCCGTCTG GACCGCCGTC GCCCTCGTCG GCGCCGTCGC CTGGGGCATG ATCGCCATCG GCCGCGGAGA GGAGGTCTCC GCCGCCTGGC TGGTCTTCGC CGCCCTGGCC TCCTACGCCA TCGGCTACCG CTTCTACGCC CGGTTCATCC AGTACAAGGT GCTGATCACC GACGACACGC GGGCCACCCC GGCCGAGCGG CTCGACAACG GCGCCGACTA CCACCCCACC GACCGCCGGG TGCTGTTCGG CCACCACTTC GCCGCCATCG CGGGCGCCGG TCCGCTCGTG GGCCCCGTGC TGGCCGCACA GATGGGCTTC CTGCCGGGCA CCATCTGGAT CATCGTCGGC GTGATCTTCG CCGGCGCCGT CCAGGACATG GTGGTGCTGT TCTTCTCCAC CCGCCGCAAC GGCCGCTCCC TGGGCCAGAT GGCGCGCGAG GAGATCGGCC CGGTGGGCGG CGTCGCCGCC CTCGTCGCCG TCATGGCGAT CATGATCATC CTGCTGGCCG TCCTGGCCAT GGTCGTGGTC AACGCGCTGG CCGAGTCCCC CTGGGGCTCG TTCTCGCTCA TCATGACGGT CCCGATCGCC CTGTTCATGG GCGTCTACCT CAGGTTCCTC AGACCCGGCA AGGTCATCGA GACCTCCGCC ATCGGCGTCG TCCTGCTGCT GCTGTCCATC GTGGGCGGCG GCTGGGTGGC CGAGCACGCC GTGCTGAGCG AGCTCTTCCT CTTCAGCAAG AACGAGCTCA CCTTCGGCCT GATCGTCTAC GGCTTCGTCG CCTCGGTCCT GCCGGTCTGG CTGCTGCTGG CCCCCCGCGA CTACCTGTCC ACCTTCATGA AGGTGGGCGT GGTCGTGCTC CTGGCCGCCT CCATCCTCAT CGTCGCCCCC GACCTGCGGC TGCCGCGCTT CACCGACTTC GCCTTCAACG GCGAGGGGCC GGTCTTCGCC GGGTCGCTGT TCCCGTTCGT GTTCATCACG ATCGCCTGCG GCGCCCTGTC GGGCTTCCAC GCGCTGATCT CCTCGGGCAC CACGCCCAAG CTCATCGAGA AGGAGAGCCA GGTCCGGATG ATCGGCTACG GCTCCATGCT CACGGAGTCG TTCGTGGCGA TCATGGCGCT GATCACCGCC TGCATCATCG ACCCCGGCGT GTACTTCGCC ATGAACATGC CCGCCACCGC GCTGGGCGAC ACCGTCCAGA GCGCCGCCGC GGCGGTCAGC CAGCTCGGCT TCACCGTCGA CGCCCAGACC CTGGAGCGCA CCGCCGAGCT GATCGGCGAG GAGTCGATCC TGGCCCGCAC CGGCGGGGCC CCCACGTTCG CCCTGGGCGC CGCGCAGATC TTCTCCGCGG TCTTCGGCGG CCAGGCCATG ATGGCGTTCT GGTACCACTT CGCCGTCATG TTCGAGGCGC TGTTCATCCT CACCACCGTG GACGCGGGCA CCCGGGTGGG CCGCTTCATG CTCCAGGACA CCCTGGGCAA CGTCTACAAG CCCATGCGCG ACCCCAGCTG GCGCCCGGGC GCCTGGCTGT GCAGCGCCCT GATCGTGGGG GCGTGGGGCT ACTTCCTGTG GGCCGGGGTC AACGACCCCC TGGGCGGCAT CAACCAGCTC TTCCCGCTGT TCGGCATCGC CAACCAGCTC CTGGCGGCGA TCGCCCTGGC GGTGTGCACC ACGCTGATCA TCAAGTCCGG ACGCGCCAAG TACGCCTGGG TGACCCTGGT GCCGCTGGCC TGGGACGCGG CGGTCACGCT GACCGCCAGC TACCAGAAGA TCTTCTCCTC CGACCCCGCC ATCGGGTTCT TCGCCCAGCG CCAGCAGTTC TCCGAGGCCC TGGCCGCGGG GGAGACGAGC GTGGGCGCCA TCCAGGGCGT GGAGGACATG CAGAGGGTCG TGCTCAACAC CACCGTCAAC GGCGTCCTGT CGGCGCTGTT CGCGATCCTG GTCCTCATCA TCCTGGTCGA CTCGGTCCGG GTGTGGATCA GGGCGCTGCG GGCGCCGGAG GGCACGCTGC CCACCAGCGA GGCGCCCCAC GAGACGTCCC GGCTGTGGGC CCCCTCGGGG CTGGTGCCCA CCCGCGAGGA GCGTGAGCAC GCCCGCCGAC TGGCCCCGGC GGGGGAGGGC GACGTCACCG CGCGCAGGGA GGAGTAG
|
Protein sequence | MSSVPTRGPA EAGSPRPRWT PRSIAVWTAV ALVGAVAWGM IAIGRGEEVS AAWLVFAALA SYAIGYRFYA RFIQYKVLIT DDTRATPAER LDNGADYHPT DRRVLFGHHF AAIAGAGPLV GPVLAAQMGF LPGTIWIIVG VIFAGAVQDM VVLFFSTRRN GRSLGQMARE EIGPVGGVAA LVAVMAIMII LLAVLAMVVV NALAESPWGS FSLIMTVPIA LFMGVYLRFL RPGKVIETSA IGVVLLLLSI VGGGWVAEHA VLSELFLFSK NELTFGLIVY GFVASVLPVW LLLAPRDYLS TFMKVGVVVL LAASILIVAP DLRLPRFTDF AFNGEGPVFA GSLFPFVFIT IACGALSGFH ALISSGTTPK LIEKESQVRM IGYGSMLTES FVAIMALITA CIIDPGVYFA MNMPATALGD TVQSAAAAVS QLGFTVDAQT LERTAELIGE ESILARTGGA PTFALGAAQI FSAVFGGQAM MAFWYHFAVM FEALFILTTV DAGTRVGRFM LQDTLGNVYK PMRDPSWRPG AWLCSALIVG AWGYFLWAGV NDPLGGINQL FPLFGIANQL LAAIALAVCT TLIIKSGRAK YAWVTLVPLA WDAAVTLTAS YQKIFSSDPA IGFFAQRQQF SEALAAGETS VGAIQGVEDM QRVVLNTTVN GVLSALFAIL VLIILVDSVR VWIRALRAPE GTLPTSEAPH ETSRLWAPSG LVPTREEREH ARRLAPAGEG DVTARREE
|
| |