Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1642 |
Symbol | |
ID | 8447241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 1808703 |
End bp | 1810037 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645040765 |
Product | hypothetical protein |
Protein accession | YP_003201021 |
Protein GI | 258651865 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4584] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.00443318 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00160164 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTCACAC AGGAGGAAGA CGTGGACGCG CACGCCCTGC GCCGCAGGGG GTGGTCGATC TCCGCGATCG CCCGCCACCT GGGCAAAGAC CGAAAAACGA TCAGGGCGTA CTTGAACGGT GAGCGGACCG CCGGAGTCCG CTCACCGGCC GGCCCGGACG TGTTCGAACC GTTCGTGACC TACTGCCGGG AACGGCTGGT CGAGGACCCG CACCTATGGG CGACCGCCTT GTACGACGAG TTGCTGCAGC TGGGTTACGA CCGGTCCTAC CCGCGGCTGA CGCACAACCT GCGCACTCGT GGGCTGCGGC CGGTGTGCCA CGCCTGCCGG CCGGCCGGCG GCCGCCCCGC AGCGGTCATC GACCACCCGC CGGCCGAGGA AACCCAATGG GACTGGCTCG AGCTGCCCGA CCCACCCAGG AGTTGGGACG GCTACGGAGC CAAGGCGTTC CTGCTGGTCG GCGCGCTGGC CCACTCGAGC AAGTGGCGGG GGGTGCTGGC CGAGGCGATG GACCAGCCGC AGCTGATCGA CGCGCAGCAC CAGGTCGTGG TCCGGCTCGG CGGGCTGACC CGGGTCTGGC GGTTCGACCG GATGGCCACC GTCGTGCACC CGGGCACCGG CAAGGTCACC GCCTCTTACG CCGCGGTCGC CAAGCACTAT GGGGTACAGG TCAAACCGTG CCCACCGCGG CGGGGCAACC GCAAGGGTGT GGTCGAGAAG GCCAACCACA CCGCGGCGCA ACGATGGTGG CGCACCCTGC CCGACGACAT CACCGTCGCC GACGCGCAGG CCCGGCTCGA CCAGTTCTGC GCCACCGTCG GCGACACCCG CGACCGCGTC GATGTCGACG GGAACCGGTG CACCGTCGCG GACCTGGCCG CCCGGGAACG GCTCGCCCCG CTACCCGCCA CGGCGTTCCC CGCCGAGCTG GCCGTCGAGC GGGTCGTATC AGCGCAGGCG TTGGTCTCGT TCCGCGGGAA CCGGTACTCC GTGCCGCCCG AGCTGGCCAA CGGGCCGGTC ACCGTGACCC ACCGCCTCGG CTCCCCGGTG CTGGCGTTCG TCACCGACCG CGGCGTGACA GTAGCCCTGC ATCACCGGGC CGGCGACGGC ACCGGCGCCA CTATCCGCAG CGAGCACCAC GTCACCGCAC TGAACAAAGC CGCGCTGGCC GCGTTCACCA CCGCCAAACC GCACCGCCGC AAACAGCGCA TCCCACCCGG CCCGGCCGCC CGGCACGCCG CACAGGTGTT GCGCGGCGCC ACCACCACAC CCGACCCCGT CGTCGATCTG ACCGTCTACG CGGCCGCGGC CGCCCGCCGG AGGAACCTGC CATGA
|
Protein sequence | MLTQEEDVDA HALRRRGWSI SAIARHLGKD RKTIRAYLNG ERTAGVRSPA GPDVFEPFVT YCRERLVEDP HLWATALYDE LLQLGYDRSY PRLTHNLRTR GLRPVCHACR PAGGRPAAVI DHPPAEETQW DWLELPDPPR SWDGYGAKAF LLVGALAHSS KWRGVLAEAM DQPQLIDAQH QVVVRLGGLT RVWRFDRMAT VVHPGTGKVT ASYAAVAKHY GVQVKPCPPR RGNRKGVVEK ANHTAAQRWW RTLPDDITVA DAQARLDQFC ATVGDTRDRV DVDGNRCTVA DLAARERLAP LPATAFPAEL AVERVVSAQA LVSFRGNRYS VPPELANGPV TVTHRLGSPV LAFVTDRGVT VALHHRAGDG TGATIRSEHH VTALNKAALA AFTTAKPHRR KQRIPPGPAA RHAAQVLRGA TTTPDPVVDL TVYAAAAARR RNLP
|
| |