Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2082 |
Symbol | |
ID | 8447692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 2295644 |
End bp | 2296834 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 645041204 |
Product | hypothetical protein |
Protein accession | YP_003201449 |
Protein GI | 258652293 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.00946987 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.00350075 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTGACAG AGACTCACCA CCTGGATATC AAGCGCGAGA TCGGCGTCAA ACCGGGCGAC CGGGCCGAGC TGGCGCGCGA CCTCGCTCAG TTCGCCATCG ATGGTGGCGC TGTCATCGTC GGCGTCGAAG AGGACAAGGC CACAAGAACT TGGACGCTGA CACCACAGAA GTTGGTGGGC CTGCAGGAGC GCATCGAGCA GATCGCGAAC AGCCAGATCG ATCCTCCGCT GTCGATCGCG ATCCGCGTAC TGGCGCTTGC GGAAGACCCA ACTCAGGGCT ACGTCGTGAT CCAGGTCCCA GTCTCGCCGC GGGCACCCCA CATGGTCGGT GGCATCTACT ACGGCCGTGG TGAGACCCGT CGCAATAGGC TATCCGACGC GGAGGTAACG CGGTACCACG CCGCACGTCG CGACACGACG AGCCGGATCG AGGACCTGCT GCAGGCCGAG ATCGACCGGG ATCCGATCAT CGCCGCTGGG CAGGCACGGC GCGGTCACTT GTATCTGGTC GCGCAGCCCG AAATCGACCA CGACGAACTT GCTCTTACAC TGCTTGAAAG CGCGCAAGGG CCGATGGCGG AACCGTACCG CACGATCACA GCTGGAGCCG AGCAGTTCGT CCACCAGAGT GTTCGAATCT ATGAACCGTC ACCTGGCTAT GCGTCATCTA TCGCGATACG GTCGACGGGG CGTGCGTACT GCAGTCAAAC GCTCTCGGAT GGCCGACGAT ACCGACCCGA CGGTGATTTC GAGAGTGATA CTGTCGATAT CGAAGTGCAC GAAGACGGTG GGATACGGGC CATGGTCGGC CGCATGACCG AGGAATATGC TGGTCGCAAC AGTCGAAGCA CGCCAGAGTC AGTCATTTTC GACGGCCTCG CAGTCGCATA TGCGGTCCGG CTGGTTCATT GGGCGCGGAT GCTATCAGTA CTCACTGGGT ACCACTCGGG GTGGCTGTTC GGTATCGCAG CAACCGGCCT CGAAGGCCGC CGCAGTCTTG TCTGGGCTCA GAGATTTCCA CCCCGAGGCC CGCACTACGA CCAAGACAGC TACCGGCGAA CGACAACGGC GACACTCACC GAAATGACCG AACATCCGGG AGTCGTAGCT AAAAGGCTCA TTGGAGCATT ACTCCGCGGA CTCGGGACGA CCGAAGAGTT TCAGGAGGCG TTCAGCCAGC CACAGAATTG A
|
Protein sequence | MLTETHHLDI KREIGVKPGD RAELARDLAQ FAIDGGAVIV GVEEDKATRT WTLTPQKLVG LQERIEQIAN SQIDPPLSIA IRVLALAEDP TQGYVVIQVP VSPRAPHMVG GIYYGRGETR RNRLSDAEVT RYHAARRDTT SRIEDLLQAE IDRDPIIAAG QARRGHLYLV AQPEIDHDEL ALTLLESAQG PMAEPYRTIT AGAEQFVHQS VRIYEPSPGY ASSIAIRSTG RAYCSQTLSD GRRYRPDGDF ESDTVDIEVH EDGGIRAMVG RMTEEYAGRN SRSTPESVIF DGLAVAYAVR LVHWARMLSV LTGYHSGWLF GIAATGLEGR RSLVWAQRFP PRGPHYDQDS YRRTTTATLT EMTEHPGVVA KRLIGALLRG LGTTEEFQEA FSQPQN
|
| |