Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2140 |
Symbol | |
ID | 9245990 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2559397 |
End bp | 2562858 |
Gene Length | 3462 bp |
Protein Length | 1153 aa |
Translation table | 11 |
GC content | 80% |
IMG OID | |
Product | transcriptional regulator, SARP family |
Protein accession | YP_003680068 |
Protein GI | 297561094 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.032223 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGGGAG AGGCGACGCA GGCGGCGGTG TCCTTCCGCG TGCTGGGCCC TCTGGAGGCC GTCGGCGCGC ACGGGCCGCT CGCCCTCAAG GGGCCGCGGC ACCGCGCGGT GCTGGCCCGG CTCCTGGTCG CCGAGGGCCG TGCCGTGCCC GTGGACCGCC TCGTGGACGA CCTGTGGGAG GCGCCCGCCG AGGGCTCCGT GGCGGCGGTG CGCACCTTCG TCTCCGCGCT GCGCCGGTCC CTGGAACCGG ACCGGCCCGC GCGGCGACCC GCCCGGCTGC TGGTGACCGC GCCACCGGGT TACGCGCTGC GGGCCGGACC CGACGCGGTC GACGCCCGGC GCTTCGCCGC GGCCGTGGCC CGCGGCGGAG CGCTGCTCAC CGAGGACCGG CCCGGAGCGG CGCTGGACGG GCTGGAGGAG GCCCTCGGGT GGTGGCGGGG GCCCGCCTAC GCCGAGTTCG CCGCCTACCC CTGGGCGCGG GCCGAGGCCG ACCGGCTGGA GGGGCTGCGG CTGCTGGCGG TGGAGCGGCA CGCCGAGGCC CTCCTGGCGC TGGGCCGTGC GGGCGACGCC GTGCCCGCCC TGGAGGCGCA CGCCCTGGCC CACCCCCTGC GCGAGAACGC CTGGCGGCAG TGGGCGCTGG CCCTGTACCG CTCGGGCCGC CAGGGTGACG CGCTGGCCGC CCTGCGCCGC GCCCGCCGGA CCCTGGCGGA CGAACTGGGG GTGGACCCCG GCCCCGAACT GCGGCGGTTG GAGTCCGACG TCCTGGCCCA GGCTCCCCAC CTCATCCCGC GTGCCGCCAC CGCGGTTCCG GCACGGGCGG AGAGCGCGCC CGAACGCGCC CAGCGTCCCT TCGTGGGCCG GGCCCGGGAG CTGGAACTGC TGGAGGGGGC CGCCGCCTCC GCCGCGCCGC GCGCTCCGGC CCGGGTCGCG CTGGTCTCCG GCGACGCCGG GGCGGGTAAG ACGGCCCTGG CCGAGGAACT CGCGCGGCGG CTGGCCGGGC GCGGGTGGAC CGCGGCCTGG GGGCGCGGCC CCGAGCACGA GGGGGCGCCC GCGGTCTGGC CCTGGACGCA GATCGCCGCC GTGCTGACGG CGGCCGCGGA CACCGCGACC CACGGGGTCG CCACCGGCCC GGCACAGGCC GCGGAACACG CCGCCGACCC CGGCGCCCCG CGCGCTGCCA CCACCGACCC CACCGGCACC GGATCCGGGG CCCCGCGCAT CGATACCGCC ACGGCCACGG GCATCACAGC GCCCAGCGGC ACCGCTGCTC CCGCCCGGGA CGACGACACC ACAGCGCCCG GGGTCGCCAG CGCGGACCCC GCCGCGGCCC GGTTCGGCGT CCTGCGCCGG GCCGCCCGCC TCCTCGCCTC GGCCACGCGG CGCGGCCCCG TCCTGCTGGT CTTCGACGAC CTGCACCGGG CGGCGGAGGA GACCCTGGAA CTGTTCACCT TCCTGGCCGC CGAACCGCTC CCCGGGCCCG TGCTGCTCGT GGGGACCTAC CGGACCGGCG AGGTCCTGCC CGCGCTGACC GCGGCCCTGG CCCGGCTCGC CCCCGGCGAA CCGGCCCGCG CCTACCTGGG CGGGCTGGCC GAGGACGCGA CCGGCGCGCT CGTACGGGCC CTGGTCGGCC GCGACGTCGA CGGGCGGGCG CTGCGCACCG TCCACCGGCG CAGCGGCGGC AACCCCTTCT TCGTCCGTGA GCTGGCCCGG TTGCTGTCCG ACGGGGACGG CGCCGCGCTC GACGCGGTCC CCGCCGGGGT GCGCGACGTC ATCCGCCACC GCCTGGGCTC CCTCACGCCC GGGGCCCGGG CCCTGCTGCG GCAGGCCTCG GTGATCGGCC GCGACATCGA CCCCGACGTC CTGTCGGCCC TGTCCCCGGA CGGGGACGCC CTGCTCGACG CCCTGGACGA GGCGCTGGAG GCGGGCTTCC TCACCGACCG CGCCGAACCG GACGAGCCCG ACGGCGCCCC CGGCCTGCGC TTCGCGCACG TGCTGGTGCG CGACACCCTC TACCAGGACC TCTCCCGACC GCGCCGGGCG CACTGGCACA CGGCGGTGGC CGAGGCCGTC GAGGCGCTGC ACCCCGACCG GGCCGACCTG CTCGCGCACC ACTTCGGGCG GGCGGGAACC CGCGCCACCG CGACCCGGGC CGCCCACCAC GCCCGCACAG CCGCCCTGCG GGCCGAGGAG CGCTTCGCCC CGCACGAGGC GGCCCGGCTG TGGCGGGAGG CCGTCGCCGC CCACGACCGC TCCGGCGAGG ACCGGCCCCG CGAACGCCTG GAGGCGGTCA TGGGCATGGT GCGGGCGCTG GCCGTCACCG GCCGCCTGGA GGAGGCCCGC CACCACCGCG CCCGGGCGGT CGCCGCGGCC GAGGAGCTGG GTGACGCGGA ACTGACGGCG CAGGTCATCA CCGCCTTCGA CGTGCCCGCC CTGTGGCCGC GCAACGACGA CGAGGAGCTG GCCCGCCAGA TCGCCGGGGC CGCCGAGCGC ACCCTGGCCG CGCTGCCCGA GGACCGCCCC GAGCAGCGCG TCCGCCTGCT GTGCGCCCTG GCCCTGGAAC TGCGCGGCAC CGCCACCGGC CGGGGCCCGG ACGCGGCCCG CCGGGCCGAG GAGGCCGCCC GCGGGCTCGG CGACCCCGCC CTGCTGTGCC TGGCGCTCAA CGCCCGCTTC ATGCAGTCCT TCCAGGGCTC CGGGCGGGCC CCGCAGCGCG TGGAGATCGG CGCGGAGCTG GTCGACCTGG CCTCTCGGCA CGGCCTGGTG ACCTTCGAGG TGCTGGGCCA CCTCGTCCTG GTCCAGGCGC ACTCCGCGCT CGCCGACTTC GGTGCCGCCG ACGCCCACGC GGCCGACGCC GACCGGCTGG GCTCGCGCTA CGGGATTCCG CTGGTGGGGG TGTTCACCCG CTGGTACGAG GCACTGCGCA CGGCCGCGCG GGGGGCCGTC GAGGAGGCCG AGGCCGCCTA CCGTGCCGCG AGCGTGCGGC TCGCCGACAC CGGCATGCCC GGTGTGGAAC AGGGCATCCT GCCCCTGGCG CTGCTGTGTC TGCGCCTTCA GGGCGGCCGA CCCGCACCGG TGGACCCGCG CCAGGACTGG GGCCCCTACG CGCCCTGGGC CGACGCCCTG GCCTCGCCCG AGTCCGCGCC CCCGCCGCCC GACGCACCGC CCGGCCTGCT CGGGGAGGCT CTGACCTGTC TGGCCGCCCG GGCCGCCACC GCCGTCGGCG ACCACCCCGC GATGGAGCGC GCCCACCGCC TGCTCTCACC CGCCGCCGGG GAGCTGGCCG GTGCGGGCAG CGGTCTGTTG ACCCTGGGAC CGGTCGCGCA CCAGCTCGGC GACCTCGACC GCGCGCTCGG ACGCCGTGAG CGGGCCGCGG AGCACTACCG GCTGGCCCTG CGCGTGGCCA TCCGGGCCGG GTCGCCGCAC TGGACAGCCG CGGCCCGGGC GGCCCTGGCC GACCTGGGCT GA
|
Protein sequence | MGGEATQAAV SFRVLGPLEA VGAHGPLALK GPRHRAVLAR LLVAEGRAVP VDRLVDDLWE APAEGSVAAV RTFVSALRRS LEPDRPARRP ARLLVTAPPG YALRAGPDAV DARRFAAAVA RGGALLTEDR PGAALDGLEE ALGWWRGPAY AEFAAYPWAR AEADRLEGLR LLAVERHAEA LLALGRAGDA VPALEAHALA HPLRENAWRQ WALALYRSGR QGDALAALRR ARRTLADELG VDPGPELRRL ESDVLAQAPH LIPRAATAVP ARAESAPERA QRPFVGRARE LELLEGAAAS AAPRAPARVA LVSGDAGAGK TALAEELARR LAGRGWTAAW GRGPEHEGAP AVWPWTQIAA VLTAAADTAT HGVATGPAQA AEHAADPGAP RAATTDPTGT GSGAPRIDTA TATGITAPSG TAAPARDDDT TAPGVASADP AAARFGVLRR AARLLASATR RGPVLLVFDD LHRAAEETLE LFTFLAAEPL PGPVLLVGTY RTGEVLPALT AALARLAPGE PARAYLGGLA EDATGALVRA LVGRDVDGRA LRTVHRRSGG NPFFVRELAR LLSDGDGAAL DAVPAGVRDV IRHRLGSLTP GARALLRQAS VIGRDIDPDV LSALSPDGDA LLDALDEALE AGFLTDRAEP DEPDGAPGLR FAHVLVRDTL YQDLSRPRRA HWHTAVAEAV EALHPDRADL LAHHFGRAGT RATATRAAHH ARTAALRAEE RFAPHEAARL WREAVAAHDR SGEDRPRERL EAVMGMVRAL AVTGRLEEAR HHRARAVAAA EELGDAELTA QVITAFDVPA LWPRNDDEEL ARQIAGAAER TLAALPEDRP EQRVRLLCAL ALELRGTATG RGPDAARRAE EAARGLGDPA LLCLALNARF MQSFQGSGRA PQRVEIGAEL VDLASRHGLV TFEVLGHLVL VQAHSALADF GAADAHAADA DRLGSRYGIP LVGVFTRWYE ALRTAARGAV EEAEAAYRAA SVRLADTGMP GVEQGILPLA LLCLRLQGGR PAPVDPRQDW GPYAPWADAL ASPESAPPPP DAPPGLLGEA LTCLAARAAT AVGDHPAMER AHRLLSPAAG ELAGAGSGLL TLGPVAHQLG DLDRALGRRE RAAEHYRLAL RVAIRAGSPH WTAAARAALA DLG
|
| |