Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3264 |
Symbol | |
ID | 9247121 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3897231 |
End bp | 3898856 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | Methylcrotonoyl-CoA carboxylase |
Protein accession | YP_003681176 |
Protein GI | 297562202 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.399585 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGGG CACCTGTGCT CGGCACGGCG GTGGACACCG CCGGGCCCGC GTTCGCCGCC AACGACGCGG CGAACCGCGC CCTCGCCCTG GAGCTGCGCG AGCGGATCGC CACCGCGGCC CTGGGCGGCC CGGAGAGGAC CCGGACACGG CACGTCGAGC GGGGCAAACT CCTGCCGCGC GACCGCGTGG ACCTCCTCCT GGACCCCGGC TCCCCCTTCC TGGAGATCGC CCCCCTGGCC GCGTACGGGC TCTACGGCGC GGACGGACAG GACGCCCCCG GCGCCGGGAT GATCGCGGGC GTGGGCCGGG TCATGGGCCG CCCGACGGTC GTGGTCGCCA ACGACGCCAC GGTCAAGGGC GGCAGCTACT ACCCCATGAC CGTCAAGAAG CACCTGCGCG CCCAGGAGGT GGCGCTGCAC AACCGGCTTC CGTGCGTCTA CCTCGCCGAC TCCGGGGGCG CGTTCCTGCC CATGCAGGAC GACGTCTTCC CCGACCGCGA GCACTTCGGC CGGATCTTCT ACAACCAGGC CACGATGTCG CGGATGGGCA TCCCGCAGAT CGCCGCGGTG CTGGGCTCGT GCACGGCGGG CGGCGCCTAC GTCCCGGCGA TGAGCGACGA GGCGGTCATC GTGCGCGAGC AGGGCACGAT CTTCCTGGGC GGCCCGCCGC TGGTCAAGGC GGCCACCGGC GAGGTCGTCA CCGCCGAGGA GCTGGGCGGC GGCGACCTGC ACTCGCGGGT GTCGGGGGTC ACCGACCACC TGGCCGACGA CGATGCGCAC GCCCTCACCC TCGTCCGGCG CATCGCCGAC ACCCTCGGCC CGCCCGGACC GCCCGCGTGG GAGGTGCGCG AGCCCCGTCC CCCGGCGCTG GACCCCGGGG AGCTGTACGG GGTGGTGCCC GCCGACACGC GCACCCCCTA CGACGTGCGC GAGGTCATCG GCCGGATCGT GGACGGCAGC GAGTTCACCG AGTTCAAGGC CGAGTACGGC ACGACCCTGG TCACCGGGTT CGCGCACCTG CACGGCCACC CGGTCGGGAT CGTCGCCAAC AACGGCATCC TGTTCGCCGA GTCCGCGCTC AAGGGCGCCC ACTTCATCGA GTTGTGCGAC CGGCGCGGCG TGCCGCTGGT CTTCCTCCAG AACATCTCCG GGTTCATGGT CGGACGGGAC TACGAGGCGG GCGGTATCGC CAAACACGGG GCGAAGATGG TCACCGCCGT GGCCTGCGCC CGCGTCCCCA AGTTCACCGT GGTCGTGGGC GGGTCCTTCG GCGCGGGCAA CTACAGCATG TGCGGGCGGG CCTACTCGCC CCGGTTCCTG TGGATGTGGC CCAACGCGCG CATCTCGGTG ATGGGCGGGG AACAGGCCGC CTCGGTGCTC TCCACCGTCC GCCGCGACCA GATGGCCGCG CGCGGGCAGG AGTGGTCCGC CGAGGACGAG GAGGCCTTCA AGGCGCCCGT GCGCGACCAG TACGAGGAGC AGGGCAGCCC GTACTACTCC ACCGCGCGGT TGTGGGACGA CGGCGTCATC GACCCCGCCG ACACCCGCGA CGTGCTCGCC ATGGCCCTGT CCGCCGCCCG CCACGCCCCG CTGGAGCCGG TGGGCTACGG CGTCTTCCGG ATGTGA
|
Protein sequence | MSRAPVLGTA VDTAGPAFAA NDAANRALAL ELRERIATAA LGGPERTRTR HVERGKLLPR DRVDLLLDPG SPFLEIAPLA AYGLYGADGQ DAPGAGMIAG VGRVMGRPTV VVANDATVKG GSYYPMTVKK HLRAQEVALH NRLPCVYLAD SGGAFLPMQD DVFPDREHFG RIFYNQATMS RMGIPQIAAV LGSCTAGGAY VPAMSDEAVI VREQGTIFLG GPPLVKAATG EVVTAEELGG GDLHSRVSGV TDHLADDDAH ALTLVRRIAD TLGPPGPPAW EVREPRPPAL DPGELYGVVP ADTRTPYDVR EVIGRIVDGS EFTEFKAEYG TTLVTGFAHL HGHPVGIVAN NGILFAESAL KGAHFIELCD RRGVPLVFLQ NISGFMVGRD YEAGGIAKHG AKMVTAVACA RVPKFTVVVG GSFGAGNYSM CGRAYSPRFL WMWPNARISV MGGEQAASVL STVRRDQMAA RGQEWSAEDE EAFKAPVRDQ YEEQGSPYYS TARLWDDGVI DPADTRDVLA MALSAARHAP LEPVGYGVFR M
|
| |