Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_0473 |
Symbol | |
ID | 8446054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 523033 |
End bp | 524625 |
Gene Length | 1593 bp |
Protein Length | 530 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 645039607 |
Product | protein of unknown function DUF1152 |
Protein accession | YP_003199881 |
Protein GI | 258650725 |
COG category | [S] Function unknown |
COG ID | [COG4034] Uncharacterized protein conserved in archaea |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGAGAC CGCTCTACGT CGCCGCCGGT GGTGGCGGCG ACGCGCTCGC CGCCACCCTG CTGCACCGCG CCTACGGACC GCCCGGGCCG GCCACGATCG CCACCTTCTC CTGGGACCGG TTGATCGTGG ATCCGCTGCC GGGGCCGCGC AGCTTCTCCA ACTTCAAGGG CCTCGAGACG GTCGGTCGGC TTGAGCACGT CGTCACACCG CGCACTCGGC CCATCCCGCC GGCCGGCTCG ACCCTTCCAC CGTTGGCTCG GGACCTGGTC GGCCCGCTCG CCTCGACCCT CGTCCTTCTG GATCCCACCG ACGGCGCGGC TGGCCTGCGC GAGCAGTTGG CTGCCAGCAT GCAAGCGATC GACGCGGATG CGCTGGTTGT CGTCGACGTC GGCGGCGATG TGTTGGCGAC CGGTAAGGAA GCCGGCCTCC GTAGCCCGCT CGCGGACGCT CTCGTGCTCG CGGCCGCTCG TGGGCTCAGT CCCGACGCCC GAGTTTGGGT CGCCGGCCCC GGGGTCGACG GCGAGCTCAC CGCCGACGAC GTCGTCTCGC GAGCGCACTC AATCGGCGGG GTCCCGCTTC CACCCTTCCC CTCCGACGTT GCGGCACTGG CTCTTCCTAT CCTCCGATGG CACCCCTCCG AGGCAACCGC GCTATTCGTG GCGGCCGCCC AGGGCGTCCG CGGACTTGTC GACATCAGAT CCGGCGGCAT GCCTGTTCAG CTTGGCGCGG TCAGCTCCGA CGTCTACGAA TGCGGAGTCG ATAGTGCCTT CGAGGTTTCG CCGCTCGCGG ATTCGGTCGC CGACTCCAGA ACTCTGCTCG ACGCCGAGCA GAGGGCTATT GAGATCTGCG GGATCTCGGA GATCAGGTTC GAGGCGCGGA AAGCTGGGGC AGCAAGACTG CGCGACGGGT TTCCACCCGA TGTGCTCGAT GAAATACGTG CCTATGCTGC GGAAGCGCTA GGTCAAGGAG TTACCTACGC GACCTTCCGC CGGCTGGCCG AGCTGATCAG AATCCGCGAT CACGCGTCGA TTCAACGCAA TCTCGGCCTG AACCTTCCCG GCGCTATTGA ATCAACATTG TGCAATCTGG CGGGTTTGAC GTCATCCGGT TCGGCGGTTT GCCTCCCGGC CCTGCCTCGC CCGGCCGCTG GCAGGGCTTC GATGGATGAC TTGCCGCGTC ACTCCGTGTC CGTGGCCGGC ATCATTATCG ACGTCGAGGG CCGAATCCTG GTCGTCAAGC GTCGTGACAA CGGCGAATGG CAGCCGCCTG GTGGCGTCCT CGAGTTGGAC GAAACGATCG AGGAAGGGCT GCGGCGTGAG GTCCATGAGG AAACGGGAAT CGACGTCCAC ATCGACCGCC TTACCGGTGT GTACAAGAAC ATGCGCCTTG GTGTCGTAGC GCTCGTCTTT CGATGTCGAC CGAGCGCTGG CTCGCTCCAG GCAAGTTCCG AAACAGAGGT GGCTCGTTGG ATGTCCGCAC AAGAAGTCGA GTCCACCTTG TCGCCTGCAT TCGCCATCCG TGTCCGCGAC GCCATCGGCG AAGCTGCCTT TGTCGCGATT CGGTATCACG ACGGCACTGG GGACGTTCCC TGA
|
Protein sequence | MERPLYVAAG GGGDALAATL LHRAYGPPGP ATIATFSWDR LIVDPLPGPR SFSNFKGLET VGRLEHVVTP RTRPIPPAGS TLPPLARDLV GPLASTLVLL DPTDGAAGLR EQLAASMQAI DADALVVVDV GGDVLATGKE AGLRSPLADA LVLAAARGLS PDARVWVAGP GVDGELTADD VVSRAHSIGG VPLPPFPSDV AALALPILRW HPSEATALFV AAAQGVRGLV DIRSGGMPVQ LGAVSSDVYE CGVDSAFEVS PLADSVADSR TLLDAEQRAI EICGISEIRF EARKAGAARL RDGFPPDVLD EIRAYAAEAL GQGVTYATFR RLAELIRIRD HASIQRNLGL NLPGAIESTL CNLAGLTSSG SAVCLPALPR PAAGRASMDD LPRHSVSVAG IIIDVEGRIL VVKRRDNGEW QPPGGVLELD ETIEEGLRRE VHEETGIDVH IDRLTGVYKN MRLGVVALVF RCRPSAGSLQ ASSETEVARW MSAQEVESTL SPAFAIRVRD AIGEAAFVAI RYHDGTGDVP
|
| |