Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3944 |
Symbol | |
ID | 8449563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4353228 |
End bp | 4354955 |
Gene Length | 1728 bp |
Protein Length | 575 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645042989 |
Product | protein of unknown function DUF885 |
Protein accession | YP_003203225 |
Protein GI | 258654069 |
COG category | [S] Function unknown |
COG ID | [COG4805] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.26296 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACCGA CTGAGCCGGA GAGCCCCTCA CCGATCACCG AACTGTCCAA CGCCTACGTG GCCGAGTACG CGCGCCGCCG GCCGATCATT GCGACCTATA TCGGCCTGCC CGTCGCCCAG GACCGGCTGG ACGATCTGAG CCCGGCCGGG CTGGCCGACG GGTACGAGTT CACCACCGCG ACCGCGCGTC GCCTGGCCGA GCTGCCCTCC ACCGGGCCGG CCGACGACAT CGCCCGGGAG GTGCTCGCCG AGCGGCTCGA GGTCGACGCC GACCGGTACC GCTCCGGCTG GGCCCACGCC GACCTGAACG TCCTCGCCTC GCCGCTGCAG GCCGTGCGCG AGGTGTTCGA TCTGATGCCG ACCGACACCA CCGCCGACGT CGAGACCATC GCCCGGCGGA TGGCCGTGGT GCCGGCCGCG CTGCTGGGGT ACCGGCAGAG CCTGCTGCAG GCGGCCGAGA ACGGCCAGGT CGCCGCCGTC CGGCAGGTCG ACCGGTGCGC CGAGCAGTGC GACGTCTACT CCGGCCGCAC CGCCGAGCGG GGCTTTTTCG CCGGCCTGGC CGGCACCCTC ACCGCGGGTC CGGACGGCTC CACCGCGGTG TCCGGGGAGT TGGCCACTGA ATTGGCTACT GAGTTGGCCG CGGCTGCGGC TGCCGCCGAC CAGGCGTACG CCGAGCTGGG CGACTTCCTG CGGACCGAGC TGCGCGAGCG GGCGCCGGCG AAAGACGCGG TCGGCCGGGA GCGGTACGCG TTGGCCTCCC GCGACTTCCT CGGCGCGGTC ATCGACCTCG AGGAGACCTA CCAATGGGGC TGGTCGGAAT TCCTCACCAT CGAGGCCGAA CTGCGGGCGG TGGCCGAACG GATCGCCCCG GGTGAGGGAC CGGCCGGGGC GGCCGCCGCG CTGGATCGCC ACCCGGCCCA CCAACTGTCC GGGGTGCCGG CGCTCAAGGC CTGGATGCAG GACCTGTCGG ACCGGGCGAT CGATGAGCTG GGCCGCACCC ATTTCGACAT CCCCGAGCCG ATCCGCCGGC TGGAATGCCT GATCGCCCCG CCCGGCGGCA TCGTCGGCGC CTACTACACC GGCCCCAGCG ACGACTTCAG CCGGCCCGGC CGGATGTGGT GGGGGGTCGA GCCCGGGCGC GAGGTGTTCA ACACCTGGCG TGAGGCCAGC ATCGTCTACC ACGAGGGCGT GCCCGGCCAT CACCTGCAGA TCGCCACCTC GGTCTACCGC CGGGACCGCC TCAACGACTT CCAGCGGCTG CTGGCCGACT ACTCCGCGCA CGCCGAGGGC TGGGCGCTGT ACGCCGAGCG GCTGGTCCGC GAGCTGGGCT ACCTGGCCGA CGACGGGCGG CTGCTGGGCC TGCTGGACTC TCAGCTGTTC CGCAGCGCCC GGGTGGTCCT GGACATCGGC ATGCATCTGG AGCTGGAGAT CCCGGCCGGG ACCGGGTTCC ACGAGGGGGA GCGGTGGACC CCCGAACTCG GCCTGGAGTT CCTGCTGACC AGGACCGTCA CCGATCCCGC GCACTGCCGG TACGAGATCG ACCGCTACCT GGGCTGGCCC GGCCAGGCGC CCGGCTACAA GGTCGGCGAA CGGGTATGGC TGGCCGGCCG GGACGCCGCC CGCCGCCGGC ACGGCGACGC GTTCGATCTG CGGGCCTTCC ACACCGATGC GTTGAACATG GGCGCGATGG GGCTGGACGT GCTGGCCCGC CGGCTGGCCC TGCTGTAG
|
Protein sequence | MSPTEPESPS PITELSNAYV AEYARRRPII ATYIGLPVAQ DRLDDLSPAG LADGYEFTTA TARRLAELPS TGPADDIARE VLAERLEVDA DRYRSGWAHA DLNVLASPLQ AVREVFDLMP TDTTADVETI ARRMAVVPAA LLGYRQSLLQ AAENGQVAAV RQVDRCAEQC DVYSGRTAER GFFAGLAGTL TAGPDGSTAV SGELATELAT ELAAAAAAAD QAYAELGDFL RTELRERAPA KDAVGRERYA LASRDFLGAV IDLEETYQWG WSEFLTIEAE LRAVAERIAP GEGPAGAAAA LDRHPAHQLS GVPALKAWMQ DLSDRAIDEL GRTHFDIPEP IRRLECLIAP PGGIVGAYYT GPSDDFSRPG RMWWGVEPGR EVFNTWREAS IVYHEGVPGH HLQIATSVYR RDRLNDFQRL LADYSAHAEG WALYAERLVR ELGYLADDGR LLGLLDSQLF RSARVVLDIG MHLELEIPAG TGFHEGERWT PELGLEFLLT RTVTDPAHCR YEIDRYLGWP GQAPGYKVGE RVWLAGRDAA RRRHGDAFDL RAFHTDALNM GAMGLDVLAR RLALL
|
| |