Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2961 |
Symbol | |
ID | 8448574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 3245543 |
End bp | 3246673 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645042046 |
Product | protein of unknown function DUF21 |
Protein accession | YP_003202288 |
Protein GI | 258653132 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.192256 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00460853 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCACCG AATGGTTGCT GCTCGGGCTG GTCGTCGTGC TGCTCGCGGC CAGCGCGTTC TTCGTCGCCG CCGAATTCGC CCTGATCGCG GCCCGGCGGA CCGTCGTCGA GCCGATGGCG GTGAACAGCG CCCGGGCCCG CTCCACGCTC AAGGCGATGG AGCACGTGTC GCTGATGATG GCCTGCGCGC AGCTGGGCAT CACCCTGTGC GGGGTGCTGC TCGGCGCCCT GGGCGAACCG GCCGTGGCCG CGCTGCTGGA ACCGGTGTTC CATGCCCTGG GCGTGCCCGA GGCCTGGCTG CACCCGGTCT CCCTGGTGAT CGCCCTGCTG CTGGTGGTCT CCGCGCACGT GGCCTTCGGC GAGATGGTGC CCAAGAACAT CGCCATCGCC GATCCGGAGC GCACCGCGCT GGCCCTTGCC CCGGCGCTGC GGGCGATCGC CACCGGCATC GGGCCGATCA TCCGGACGCT GAACTGGTTC GCCAACTCGG TGGTCAAGCT GACCGGCCGC GAACCCAAGG ACGAGGTGGC CTCGGCGTTC ACCCGCGAGG AGGTCGCCGA GCTGGTCGCC GAGTCCCGGC GCGAGGGGCT GCTGGACGCC GACGAGCACC AGCTGATCAC GTCCGCGCTG GGCTTCGACA CCTCGCTGGT CTCCTCGGTG ATGGTGCCGG TCACCGAGGT CGAATCGGTG GACGAGCAGG TCACCCCGGC CGAGATCGAG CGGATGTGCG CCCGGACCGG TTTCTCCCGC TTCCCGATCG ACCGGGTCGA CGACTCGGCC CGGGGCTTCG CCGGGTACCT GCACATCCGC GACGTCGTCG ACATCCCGGC CGACCGGCGG GACGAGCCGG TGCCGCCGGA ACGCATCCGG GAGCTGCCCG CGGTGGCGCC CGGCACCGAC CTGCGGACCG CGCTTGACCG GATGCGGCGC ATCGGCGCGC ACCTCGCGCA GGTCGTCGCA CCCGTGCCCG GGGCCGGCGA CGGGCCGGAC CTGGTGGCCG ACGCCGAGCG TCCGGTCCTG GGCGTGGTGA TGCTCGAAGA CGTCATCGAG GCCCTGATCG GCGAGGTCCA GGACGCGACC CGCCGCACCC CCGGGCGGTC GGCGCCACCG GGCGAGCGGT CCGAGGCATA A
|
Protein sequence | MTTEWLLLGL VVVLLAASAF FVAAEFALIA ARRTVVEPMA VNSARARSTL KAMEHVSLMM ACAQLGITLC GVLLGALGEP AVAALLEPVF HALGVPEAWL HPVSLVIALL LVVSAHVAFG EMVPKNIAIA DPERTALALA PALRAIATGI GPIIRTLNWF ANSVVKLTGR EPKDEVASAF TREEVAELVA ESRREGLLDA DEHQLITSAL GFDTSLVSSV MVPVTEVESV DEQVTPAEIE RMCARTGFSR FPIDRVDDSA RGFAGYLHIR DVVDIPADRR DEPVPPERIR ELPAVAPGTD LRTALDRMRR IGAHLAQVVA PVPGAGDGPD LVADAERPVL GVVMLEDVIE ALIGEVQDAT RRTPGRSAPP GERSEA
|
| |