Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2344 |
Symbol | |
ID | 9246194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2794007 |
End bp | 2795056 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003680272 |
Protein GI | 297561298 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.684667 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.408649 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTGCAG ACCCGACCCC CGACTCCGCC CGGACCACCC TTTCGGAGCC GGTCACCATC TCGAAAATTG CGGAAGCCGC AGGCGTATCG GTGCCGACGG TCTCGAAGGT GCTGAACGGA CGCGCCGACG TGGCCAGCGA GACCCGTGCA CGCGTGGAGG AGCTGATCCG CCGCCACGGC TACCGGCGAC GCCGGGGCCC GGGCGGAGGC CGTTCCTCCA TGGTCGACCT GGTCTTCCAC GAACTCGACA GCGCCTGGGC GGTCGAGGTC ATCCGGGGCG TGGAGAGCGT GGCGCGCTCG GAGGGCCTCA GTGTCGTGCT GACCGAGTCC GGGGGCGCCC AGACCCCCCG CGACGACTGG GTCGACGCCG TGCTCGCCCG CCAGTCCACC GCCGTGATCC TGGTCTTCTC CGACCTGGCG CCCGAACAGC GCACGCGCCT GACCGCGCGC GGCATCCCCT TCGTCGTGGT GGACCCGGCG GGGGACCCGG GCCCGGACAT GCCCTCGGTG GGCTCGGCCA ACTGGAACGG CGGCCTGGTC GCCACCCGCC ACCTCATCGA GCTCGGCCAC CGGCGCATCG GCGTCATCGG CGGGCCCCGG CACGTGCTGT GCAGCAAGGC CCGCATCGAC GGCTACGCCT CCGCGCTGGA CTCGGCCGGA CTGACGGCCG ACCCCGCGCT CGTGCGCTAC GGCGACTTCC ACGTCGAGAG CGGGCGCGAC CGGGGCCGGG AGCTGCTCCG GCTGGACGAC CCGCCCACGG CCGTCTTCGC GGGCAACGAC CTCCAGGCCA TGGGCCTGTA CGAGGCCGCC CGCGAGCTGG GCGTGCGCAT CCCCGAGGAC CTGAGCGTGG TCGGCTACGA CGACCTGCCG GTGGCCCGCT GGATGGGGCC GCCGCTGACC ACCGTGCGCC AGCCGCTCAC CGAGATGGCC GAGGAGGCGA CCCGGATGGC GCTCACCCTG GCGCGCGGCG GCACCCCCGC CAACCTGCGC CTGGACCTGG CCACCGACCT GGTGGTCAGA CGCAGCAGCG CCCCGCCGCC CGCCTCCTGA
|
Protein sequence | MSADPTPDSA RTTLSEPVTI SKIAEAAGVS VPTVSKVLNG RADVASETRA RVEELIRRHG YRRRRGPGGG RSSMVDLVFH ELDSAWAVEV IRGVESVARS EGLSVVLTES GGAQTPRDDW VDAVLARQST AVILVFSDLA PEQRTRLTAR GIPFVVVDPA GDPGPDMPSV GSANWNGGLV ATRHLIELGH RRIGVIGGPR HVLCSKARID GYASALDSAG LTADPALVRY GDFHVESGRD RGRELLRLDD PPTAVFAGND LQAMGLYEAA RELGVRIPED LSVVGYDDLP VARWMGPPLT TVRQPLTEMA EEATRMALTL ARGGTPANLR LDLATDLVVR RSSAPPPAS
|
| |