Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1863 |
Symbol | |
ID | 8447470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 2046483 |
End bp | 2047364 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645040993 |
Product | domain of unknown function DUF1731 |
Protein accession | YP_003201241 |
Protein GI | 258652085 |
COG category | [R] General function prediction only |
COG ID | [COG1090] Predicted nucleoside-diphosphate sugar epimerase |
TIGRFAM ID | [TIGR01777] conserved hypothetical protein TIGR01777 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.508157 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00330545 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAAGGTTG TCGCCGCCGG GGTGAGTGGG TTCCTGGGGA CCCGGCTGAC CAACGCGTTG ACCGAGGCCG GCCACACCGT GGTCCGTCTG GTGCGCTCCG AGCCCACCGG TCCGCACGAG AGTCTGTGGA ACCCGCACTC GGGTGATCTG GACAGCTCCG TGTTCGACGG AGCCGACGCG GTGGTCAACC TCTGTGGAGC TCCGACCGCC TTCCATCGCT GGACCCAGGA CTACAAGCAC CTGCTGCTGA CCTCCCGGGT CAACCCGACC CGCGTGCTGG CCGCCGAGTG CGCGCGGCTG GGCGTCCCGG TCCTGCTCAA CGCGTCCGCG GTGGGCTACT ACGGCGCCCG AGGTACCGAG ATCGTCACCG AGCACACCGG CCCAGGCGAT TCCTTCCTGG CCGACCTGTG CGTGCAGTGG GAGGCGGCCA CCGCCGCGGC GGAGGCGGCC GACGTGCGGG TGGTGCACCT GCGCACCGGG CTGGTGCTGG GCATGGACGG CGACCTGCTC AAGATCATGT CGCTGCTGAC CCGCCTGTGG GCCGGCGCCC GGCTGGGATC GGGCGAGCAG TACTACCCGT GGATCTCGGC GACCGATCAC TTGGCCGCCA TGCTGTTCCT GCTCACCCAC CCCGTGCACG GGCCGGCCAA CCTGACCGCG CCCTACCCGG TGACCAATGC GGAGTTCACC AAGGAGCTCG GGCGGGCGCT GCACCGGCCC ACCCCGTGGG TCGTGCCGGA ATTCGCCATC CGCGCCCTGG TCGGCGAGTT CGCCGACGAG CTCGTCGACG GCCGGCGGGC GGTCCCGGCC GCCCTGCACG ACGCCGGTTT CCAGTTCGCC CACCGCACCC TGCCCGAGGC GCTGGCCGCC GAACTGCGGT GA
|
Protein sequence | MKVVAAGVSG FLGTRLTNAL TEAGHTVVRL VRSEPTGPHE SLWNPHSGDL DSSVFDGADA VVNLCGAPTA FHRWTQDYKH LLLTSRVNPT RVLAAECARL GVPVLLNASA VGYYGARGTE IVTEHTGPGD SFLADLCVQW EAATAAAEAA DVRVVHLRTG LVLGMDGDLL KIMSLLTRLW AGARLGSGEQ YYPWISATDH LAAMLFLLTH PVHGPANLTA PYPVTNAEFT KELGRALHRP TPWVVPEFAI RALVGEFADE LVDGRRAVPA ALHDAGFQFA HRTLPEALAA ELR
|
| |