Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2958 |
Symbol | |
ID | 8448571 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3241858 |
End bp | 3242838 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645042043 |
Product | hypothetical protein |
Protein accession | YP_003202285 |
Protein GI | 258653129 |
COG category | [K] Transcription |
COG ID | [COG2378] Predicted transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.000014666 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00431859 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGCCG GCACCGCCAA CGAGAACGCC CAGGACCGGC TGGCCCGGAT GCTGTCACTG GTGCCCTACA TCTCCCGCCG GCCCGGGGTG AGCATCGCCG AGCTGGCCCG CGAGTTCGCG GTCAGCCGCG CGCAGATCAG CGCCGACCTG GACCTGCTGA TGGTCTGCGG GCTGCCCGGC TACTACCCCG ACGACCTGAT CGACGTCGTC CTGGACGACG ACGGCGGCAC AGTCTCCATC ACCTTCGACG CCGGCATCGA GCGGCCGGTC CGGCTCACCG CCGACGAGGC CCAGGCCCTG CTGGTGGCGC TGCGGGCGCT GGCCGAGACC CCCGGCCTGG TGGAGACCGA CGCCGTCGAC TCGGCGCTGG CCAAGCTCGA GCAGCTGGAC CGGCAGGCGG CCGAATCGAC CGGCGCCGTC CGGGTGGTGG CGGCCGACCC GGCCCCGCAG CTGGGCACCG TGCGCGACGC CCTGGACCGG TCCCGGCGGT TGTGGATGCG GTACTACACC GCCTCCCGGG ACACCGTCAC CGAACGCACC GTCGACCCGT TGCGCATCCT GGTCACCGAC GGGCACGCCT ACCTGGAGGC CTACTGCCAC CTGGCCCGGG CCATCCGGCA CTTCCGCATC GACCGGATCG AGGCGGCCCA GGTGCTGGAC GAACCGGCCC AGGGCAGCCT GTGGGTCGAC TCGCAGGTGC CCGACCGGAT CTTCCACCCC GACCCGGCCG TGCCGCCGGT CGAGCTGGTC CTGACCGGAG GAGCGCGTTG GGTGGCCGAG TACTACCCGG TCGAGACGGT CACCCCGTTG CCGGCGTCGC AGGCCGGCGT CCGGGTCACC CTGCACGCCG GCAACGACGA CTGGCTGACC CGGCTGGTCC TCTCGCTCGG TGGGGACGCG GTGGTGACCA ACCGGCCCGA GGTGACCGAG CTGGTCGCCC GGCGGGCCGA ACAGGCCCTG GCCGCCTACC AGAGCGACTG A
|
Protein sequence | MSAGTANENA QDRLARMLSL VPYISRRPGV SIAELAREFA VSRAQISADL DLLMVCGLPG YYPDDLIDVV LDDDGGTVSI TFDAGIERPV RLTADEAQAL LVALRALAET PGLVETDAVD SALAKLEQLD RQAAESTGAV RVVAADPAPQ LGTVRDALDR SRRLWMRYYT ASRDTVTERT VDPLRILVTD GHAYLEAYCH LARAIRHFRI DRIEAAQVLD EPAQGSLWVD SQVPDRIFHP DPAVPPVELV LTGGARWVAE YYPVETVTPL PASQAGVRVT LHAGNDDWLT RLVLSLGGDA VVTNRPEVTE LVARRAEQAL AAYQSD
|
| |