Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3422 |
Symbol | |
ID | 8449037 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3761438 |
End bp | 3762433 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645042499 |
Product | transcriptional regulator, DeoR family |
Protein accession | YP_003202739 |
Protein GI | 258653583 |
COG category | [K] Transcription |
COG ID | [COG2390] Transcriptional regulator, contains sigma factor-related N-terminal domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.00167672 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0000946646 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGGATC GTGCCGACCC GCCTCGGCTG CTGGTCAAGG TCGCGCGGCT CTACCACGGG CGGGGACTTC GGCAAAGTGA GATCGCCGAA CGGCTGCGCA TCTCGCAGGC CCGGGTGTCC CGTTTGCTGC AGCAGGCCGA GGACCTGGGC ATCGTGCGGA CCGTCGTGGT CCTGCCGCCC GGCCTGAACA GCGAGCTGGA AGACGAGATC GAGCACCTCT ACGGGGTCCG CCAGTGCCAC GTGGTCGACG CGGTGGCCGA GGGCGAGGAC GAGCTGGCCG AGGACCTCGG CCAGGCGATG GCCGCCATCG TCGGTGGCGG CGGCATCCTG ACCTCCTCGA CCTCGGTCGT CGGCTACAAC TCGTGGAGCC GCACCCTGCA GGCGATGGTG GCCGCGCTGC AACCGATCCG CGGCGGCGCC GGCCGGGTGG TCGAGATGCT CGGCGACATC GGGCCGCCCG CGCTGCAGCA CGAGGCGGCC CGGTCCACCC AGCGGCTGGC CGCGCTGACC GGGGCCGAGC CGGTCTACCT GCGCACGCCC GGGGTGAGCT CGACGCCGCA GATGCGCGCG GCCATCCTCG ACCAGGACCG GCACGCCCGG GAGACGCTGG CCCTGCTGGA CCGGATGGAC GTCGCCCTGG TCGGCGTCGG CGAGTGCGAC GTGGTGGCCC CGCTGCGCCC CGGCGACAAC TTCTTCACCC AGGAGCAGTT CGACCGGGCC CGGGAGCTGG GCGCGGTCGG GCAGATCTGC CTACGGTTCC TGGACGAGCA CGGCCACGAG GTGGCTACCG AGTTCGACGA CCTGGTGATC GGGGTGAGCC TGGATCAGCT GCGGGTCGCC GCCCACCGGT GCGCGGCCGC CGGCGGGCCG GCCAAGTACC GGGTGATCCG GGCGGCCCTG CTCGGCCACT GGGTGGACAC GCTGGTGGTG GACACGGCGA CGGCCGATTG GCTGGTCGCG GCCGGACCGG GCGAGGCGGC GGCCGTCAGT CGGTGA
|
Protein sequence | MPDRADPPRL LVKVARLYHG RGLRQSEIAE RLRISQARVS RLLQQAEDLG IVRTVVVLPP GLNSELEDEI EHLYGVRQCH VVDAVAEGED ELAEDLGQAM AAIVGGGGIL TSSTSVVGYN SWSRTLQAMV AALQPIRGGA GRVVEMLGDI GPPALQHEAA RSTQRLAALT GAEPVYLRTP GVSSTPQMRA AILDQDRHAR ETLALLDRMD VALVGVGECD VVAPLRPGDN FFTQEQFDRA RELGAVGQIC LRFLDEHGHE VATEFDDLVI GVSLDQLRVA AHRCAAAGGP AKYRVIRAAL LGHWVDTLVV DTATADWLVA AGPGEAAAVS R
|
| |