Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2626 |
Symbol | |
ID | 8448238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 2877594 |
End bp | 2878604 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645041722 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_003201965 |
Protein GI | 258652809 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00000849178 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00271101 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCGGGG CGAGCGGAGC AGGGCTACCG GTCATCGTGT TCGATGCGCA GCCGACGTCC GCGGAGGAAC AGTTCGAGCT GTTCCACGAC ACGACCGCGC CGGTGTTCGA CACGCTCCCC TCGGGGAGCC CAGCCGACTT CTCTGCCCAA GCGACGGACT ATCTCGTGGG TGACGTGGTC ATCAGCCGGA TCGCGCACGC ACCGCAGAGC ATGCGGCGCA GCATCCGCCA TATCCGCTGC GGCAGCGAGG ACGCGCTCGC AGTCCTCGTC TACCGCCGAG GGCGTGTCGA CCTCAGCTTC GACCGCACCG AGATGACCCT GGACTCGCAG CACGTCGGCA TCATCGACCT CGCTCGATCC TTCTACGCGG CCTGCACCGA CATCGATTCG GTCTGGGCGG TCATCCCTCG TCGTCGCCTC CGTGCTTCCC TGGGGCGATC GCCGTGCGCG CGACTGCACA GGGACTCGCC CCGCGGGCGG GTGCTGCGCA GCACGGTCAT ATCCGTCTGG AACAGACTTC CGAACGCTTC CGCGGAAGAC GCAACGACTC TGGCGCAGGA AATCATCGAC GCCACCCAAT CGGTGCTCAC CGACGGCGAC TTCGCGCCTT CCGACACCGC TCTAGCAGTG GCGATGAGCG ACTTCGTCAT CGCGCACCTG GATGATCTGG ACCTCGACGC GCGCATGCTC GCCCGCACGT TCCACTGCTC GCGGTCGACG CTTTTCCGGA TCTTCGCACC GCATGGCGGT GTCGCCGCCT ACATCCGCGA CGCGCGGCTG GACCGTTGCC TCGACGAGCT GCTCGAACCG TACGAGTCAA CCCGCACGGT CCACCAGATC GCGACCAGAT GGGGATTTGA GAACCCGAGT CATTTCCACC GACTTTTCAC CACGCGCTAC GGAACTCCGC CATCCACAGC GCGCGGCACA CGCCACGCAC CGCCCGGTCG CGCCTACGAC CAAGACACGA GCAAAAAGAT CAACACGTTC CATCAATGGG CCACCCGGTG A
|
Protein sequence | MRGASGAGLP VIVFDAQPTS AEEQFELFHD TTAPVFDTLP SGSPADFSAQ ATDYLVGDVV ISRIAHAPQS MRRSIRHIRC GSEDALAVLV YRRGRVDLSF DRTEMTLDSQ HVGIIDLARS FYAACTDIDS VWAVIPRRRL RASLGRSPCA RLHRDSPRGR VLRSTVISVW NRLPNASAED ATTLAQEIID ATQSVLTDGD FAPSDTALAV AMSDFVIAHL DDLDLDARML ARTFHCSRST LFRIFAPHGG VAAYIRDARL DRCLDELLEP YESTRTVHQI ATRWGFENPS HFHRLFTTRY GTPPSTARGT RHAPPGRAYD QDTSKKINTF HQWATR
|
| |