Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_2858 |
Symbol | |
ID | 4611047 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | + |
Start bp | 2977991 |
End bp | 2979406 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639792523 |
Product | hypothetical protein |
Protein accession | YP_938842 |
Protein GI | 119868890 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.860129 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTCG CATACACCCT GCTCTCGCTG TTGGCGATCG TGGTGTTGAC CGCGGGTACC GCGGTGTTCG TCGCCGCCGA GTTCTCGTTG ACGGCGCTGG AGCGCAGCAC CGTCGAGGCC AATGCCCGTA GCGGGCACCG GCGCGACCAG CTCGTGCGCC GCGCCCACCG CACGCTGTCC TTCCAGCTCT CGGGCGCCCA GGTGGGCATC TCGATCACCA CGCTGGCCAC CGGCTACCTG GCCGAACCCG TGGTCGCCCG GCTGCTGCAG CCGGGTCTGG ACGCGATCGG ACTGCCGGAA CAGGCCGCGA GCGGCGTCGC CCTGTTCCTC GCGATCCTGA TCGCCACCTC CCTGTCGATG GTCTTCGGCG AACTCGTGCC GAAGAACCTC GCGGTGGCCC GCCCGGCGCC GACCGCCCGC GCAGCCGCAC CTCCGCAGCT GCTCTTCTCG ACGATCTTCA CGCCGCTGAT CCGGCTCACC AACGGCACCG CGAACATGAT CCTGCGCCGG CTGGGCATCG AACCCGCCGA GGAACTGCGC TCCGCGCGGT CGGTGCAGGA GCTGATCTCG CTGGTGCGCA ACTCCGCGCG CAGCGGTTCA CTCGACCCGG TGACCGCGGT GCTGGTGGAC AGGTCACTGC AGTTCGGCGA GCGCACCGCC GAGGAACTGA TGACCCCCCG CACCGAGATC GAGGCCCTGC AGGCCGACGA CACCGTCGCC GACCTCATCG CCGCGGCGAT CGAAACGGGG TATTCGCGCT TCCCGATCGT CGAGGGTGAC CTCGACGAGA CCATCGGCGT CGTCCACGTC AAACAGGTGT TCTCGGTACC GCGCGACGAC CGCGACCGCA CCCGCCTCGC GGCAATCGCG ATCCCGGTGG CCACCGTGCC CTCGACGCTG GACGGGGACG CGGTGATGAC CCAGATCCGC GCCAACGGGC TGCAGACCGC GCTGGTGGTC GACGAGTACG GCGGCACCGC CGGCATGGTG ACCGTCGAGG ATCTGATCGA GGAGATCGTC GGCGACGTCC GCGACGAACA CGATGACGCC ACCCCCGACG TGGTCGCCGC CGGCGACGGC TGGCAGGTGT CGGGCCTGCT GCGGATCGAC GAGGTGGCCA CCGGGACCGG TTTCCGCGCC CCCGAGGGCG AGTACGAGAC CATCGGCGGG CTGGTGCTGC AGGAGCTCGG ACACATCCCG GAAGTGGGCG ACTCGGTCGA GCTGACCGCG TTCGATCCGG ACGGGCCGCT CGACGATCCG ATCCGCTGGC AGGCCAAGGT CGTGCAGATG GACGGTCGCC GGATCGACCT TCTGGAGTTG GTCGAACTCG GGCGCCGCGG CGACACCGAC GACGACCACA TCGACAACGA CGACCACCAC AACAAAGACG GCGCCGCGCC GGAGGAGGAC CGCTGA
|
Protein sequence | MSVAYTLLSL LAIVVLTAGT AVFVAAEFSL TALERSTVEA NARSGHRRDQ LVRRAHRTLS FQLSGAQVGI SITTLATGYL AEPVVARLLQ PGLDAIGLPE QAASGVALFL AILIATSLSM VFGELVPKNL AVARPAPTAR AAAPPQLLFS TIFTPLIRLT NGTANMILRR LGIEPAEELR SARSVQELIS LVRNSARSGS LDPVTAVLVD RSLQFGERTA EELMTPRTEI EALQADDTVA DLIAAAIETG YSRFPIVEGD LDETIGVVHV KQVFSVPRDD RDRTRLAAIA IPVATVPSTL DGDAVMTQIR ANGLQTALVV DEYGGTAGMV TVEDLIEEIV GDVRDEHDDA TPDVVAAGDG WQVSGLLRID EVATGTGFRA PEGEYETIGG LVLQELGHIP EVGDSVELTA FDPDGPLDDP IRWQAKVVQM DGRRIDLLEL VELGRRGDTD DDHIDNDDHH NKDGAAPEED R
|
| |