Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1575 |
Symbol | |
ID | 9245425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1926748 |
End bp | 1928253 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | |
Product | cytidyltransferase-related domain protein |
Protein accession | YP_003679510 |
Protein GI | 297560536 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.528601 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0217414 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGGGG GAACCGTGGT GGTGGTCGGC GACGCCCTGC TCGACGTGGA CCTGCGCGGC GTGTCCCGGC GCGACTGCCC CGACGTGCCC GCGCCCGTGC TGGAGGAGCC CGAGCCCTGG TACCGGCCCG GAGGCGCGGC CCTGGCCGCC CGCCGCGCCC GGCAGGACGG CCGCGACGTG GTCCTGGTGA CCGCCGTGGG CCGCGACGCG GCCGCCGACG AACTGGCCGC GCTCGTGGGG GAGGGCGTGC GGCTGGTGGG GCTGCCCCTG ATGGAGCACA CGCCCACCAA GACCCGGGTC CAGGCCAACG GCCGCACCGT GGCCCGTCTG GACCAGGGGT GCGAGGGCGT GGAACTCGAC GCCCGCGCCG AGGACGTCGC CGAGGCCCTG GCCGGGGCGG CGGCCGTGCT CGTGGCCGAC TACGGGCACG GGCTCACCCG CCAGCGCGCC GTACGCCGGG CCCTGGCGGC CTGCGCGCGG CGCGGCGTCC CGCTCGTGTG GGACCCGCAC CCGCGCGGCG CCGACCCCGT GCCCGGAACC CGCCTGGCCA CGCCCAACGC CGCCGAGGCC GGGGTCGCCG GGGGGACCGG CGAGCGGGCC CTGCGCCGGG CCGGTGAGCT GGCCGGGGCG TGGGACGTGC ACTCGGTGGC CGTCACCCTC GGCGCGCGCG GCGCCGCCTG GTCGGACGCC GGGGGCGGGT GCGCCCTCCT GCCGGGCACA CCCGTCGACG CGCCGACCGA CACCTGCGGC GCCGGGGACG CCTTCGCCGC CGCCTGCGCC ACGGCGCTCG CCGACGGCGA CGACGTACGC GACGCCGTCC GACGCGGGGT CGCCTCCGCC TCGGCCTTCG TCGCCGGGGG CGGGGCCTCG GCCTACGCCG CCCCGAGCAC CGCCGCCGGG CGGGCGCGCG CCGCCGCGGT GCCCGGCCCC CGCGCGGGCG AGGGCGGCCG GGCGGAGAGG GTCGTGGCGA CCGGCGGCTG CTTCGACGTC CTGCACGCGG GCCACGTCGA CCTGCTGCGC CGCGCCCGCG CCCTGGGCGA CCGCCTCGTG GTCCTGCTCA ACAGCGACGC CTCCGTGCGC GCGCTCAAGG GCAGCGGCCG TCCGGTGGTC GCCGAACAGG ACCGCGCCCG CGTGCTGGGC GCGCTCGACT GCGTGGACGA GGTGGTCGTC TTCGACGAGG ACACGCCCGT GCGCGCCCTG GAGGAGCTGC GGCCCGACGT GTGGGTCAAG GGCGGCGACT ACGAGGTGGA GGACCTGCCC GAGACGCCCG TCGTGCGGAG GGCCGGGGGG GGAGGTGGTC ACCGTGCCCC TCGTGCCCGG CCACTCCACG ACCGGGCTGT TCACCCGCAT CCGCGGCCGG GGCCGCGACG GGGTACCCGC CCGCTGAGGG CCCCGGGACC GGACGCCCCG GCCCCGGAGC CCGGGAAGTC GGAAACGAAC GACGGACAGC GACGAACGAC GACGGAGAAC GAGGGAGACA GATGCGTCCA CTCGGAAACA CGCTGA
|
Protein sequence | MSGGTVVVVG DALLDVDLRG VSRRDCPDVP APVLEEPEPW YRPGGAALAA RRARQDGRDV VLVTAVGRDA AADELAALVG EGVRLVGLPL MEHTPTKTRV QANGRTVARL DQGCEGVELD ARAEDVAEAL AGAAAVLVAD YGHGLTRQRA VRRALAACAR RGVPLVWDPH PRGADPVPGT RLATPNAAEA GVAGGTGERA LRRAGELAGA WDVHSVAVTL GARGAAWSDA GGGCALLPGT PVDAPTDTCG AGDAFAAACA TALADGDDVR DAVRRGVASA SAFVAGGGAS AYAAPSTAAG RARAAAVPGP RAGEGGRAER VVATGGCFDV LHAGHVDLLR RARALGDRLV VLLNSDASVR ALKGSGRPVV AEQDRARVLG ALDCVDEVVV FDEDTPVRAL EELRPDVWVK GGDYEVEDLP ETPVVRRAGG GGGHRAPRAR PLHDRAVHPH PRPGPRRGTR PLRAPGPDAP APEPGKSETN DGQRRTTTEN EGDRCVHSET R
|
| |