Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_0442 |
Symbol | |
ID | 4109288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | - |
Start bp | 490259 |
End bp | 493399 |
Gene Length | 3141 bp |
Protein Length | 1046 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638029568 |
Product | protein of unknown function DUF224, cysteine-rich region |
Protein accession | YP_637619 |
Protein GI | 108797422 |
COG category | [C] Energy production and conversion |
COG ID | [COG0247] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGACACGC AGATTCTGAT CCGACTCATC GTCGGGTTGC TGATGACCGG CATCGTGATC GCACTCGCCG CAAAACGCGT GCTGTGGTTG ACGAAGCTGA TCCGGTCCGG CGCGCCCACC AGCGAGGCGA ACAACCGCAA GGACAACCTC GGCAAACGCA TCACCACGCA GTTCGAAGAG GTCTTCGGCC AGACCCGCCT GCTGCGCTGG TCGATCCCCG GTATCGCGCA CTTCTTCACG ATGTGGGGCT TCTTCATCCT CGCGTCGGTC TACCTGGAGG CCTACGGCCT GCTGTTCGAC CACGACTTCC ACATCCCGAT CATCGGCACA TGGGACGCAC TCGGTTTCCT GCAGGACTTC TTCGCCGTCG CGGTGCTGCT CGGCATCATC ACCTTCGCGA TCATCCGCAT CGTCCGCGAG CCCAAGAAAC ACGGCCGCGA CTCCCGGTTC TACGGTTCGC ACACCGGCGG CGCCTGGCTG ATCCTGTTCA TGATCTTCAA CGTCATCTGG ACCTACGCGC TGGTGCGTGG CGCCGCCGCG GACGTCGGCA ACCTGCCCTA CGGCAACGGC GCGTTCTTCT CCCAGTTCAT GGGCTGGGTC CTGAGCCCGC TCGGCCACGA CGTCAACGAG TGGGTCGAGA CCATCGCCCT GCTGCTGCAC ATCGCGGTCA TGCTGGTCTT CCTGCTGATC GTGCTGCACT CCAAGCACCT TCACATCGGT CTGGCGCCCA TCAACGTCAC GTTCAAGCGG CTGCCCAACG GCCTCGGCCC GTTGCTGCCG ATGGAGTACA ACGGCGAGCC GATCGACTTC GAGGATCCCG CCGAGGACGC CGTGCTGGGC CGCGGCAAGA TCGAGGACTT CACCTGGAAG GGCTACCTCG ACTTCACGAC GTGCACCGAA TGCGGCCGCT GCCAGTCGCA ATGCCCGGCG TGGAACACGG GTAAGCCGCT GTCGCCCAAG CTCGTGATCA TGAACCTGCG CGACCACATG TTCGCCAAGG CGCCCTACAT CCTCGGCGAC AAGGAGTCGC CGCTGGAGAA CACCCCCGAG GGCGGTCTCG GCGAGGAACT GCGCGGTGAG AAGGAATCCG AGAAGCACTC CCACGACCAC GTCCCGGAGT CCGGCTTCGA ACGGATCATG GGCTCCGGAC CCGAACAGGC CACCCGCCCG CTGGTCGGCA CCCTCGAACA GGGCGGCGTG ATCGACCCCG ACGTGCTGTG GTCCTGCACC ACCTGCGGCG CCTGCGTCGA GCAGTGCCCG GTCGACATCG AGCACATCGA CCACATCGTG GACATGCGCC GCTACCAGGT GATGATGGAG TCCGAGTTCC CCGGTGAACT CGGTGTGCTG TTCAAGAACC TGGAGACCAA GGGCAACCCC TGGGGCCAGA ACGCCAAGGA CCGCACCAAT TGGATCGACG AGGTCGACTT CGACGTGCCG GTGTTCGGCA AGGACGTCGA ATCCTTCGAG GGCTACGAGT ACCTGTTCTG GGTCGGCTGC GCCGGCGCCT ACGAGGACCG CGCGAAGAAG ACCACCAAGG CGGTCGCCGA ACTGCTCGCG ATCGCCGGCG TCAAGTACCT GGTGCTCGGC GAGGGTGAGA CCTGTAACGG CGACTCGGCC CGACGCTCCG GCAACGAGTT CCTGTTCCAG CAGCTCGCCG CACAGAACGT CGAGACCCTC AACGACCTGT TCGAAGGTGT GGAGCGGGTC GACCGCAAGG TCGTCGTCAC CTGCCCGCAC TGCTTCAACA CCCTCGGTCG CGAGTACCCG CAGGTCGGCG GCAACTACAC CGTCCTGCAC CACACCCAGC TGCTCAACCG GCTGGTCCGC GACAAGAAGC TGGTTCCGGT CACCCCCGCC GACGGTGGGG CCGACATCAC CTACCACGAT CCCTGCTACC TGGGCCGGCA CAACAAGGTC TACGAGGCGC CCCGTGAGCT GATCGGCGCC TCCGGCGCGA AGCTGACCGA GATGCCGCGT CACGCCGACC GCGGCCTGTG CTGCGGTGCC GGCGGTGCGC GGATGTGGAT GGAAGAGCAC ATCGGCAAGC GCGTCAACGT GGAGCGCACC GAGGAGGCGA TGGACACCGC CTCGACGATC GCCACCGGTT GCCCGTTCTG CCGCGTGATG ATCACCGACG GTGTCGACGA CGTCGCCGCC ACCCGCAACG TCGAGAAGGC CGAGGTCCTC GACGTGGCCC AGCTGCTGCT CGGGTCGCTG GACAAGAGCG GCGTCACGCT GCCGGAGAAG GGCACCGCGG CCAAGGAGGC CGAGGAGCGC GCCGCCGTTC GTGCTGAGGA GACGGCAGCG GCCGCACCGG CCCCAGAGAA GGCCCCCGAG AAGGAGCCGG AAGCGCCGGC CGAAGCACCC GCGGCCAAGG CGTCGACGGC CACCGAATCG AAGCCGGCGG CCGCAGCTCC GGCGAAGGGG CTCGGTATCG CCGGGGGCGC CAAGCGTCCC GGTGCGAAGA AGACCGCCAC CGAGAAGACC GCCGCGGCGC CCGCGGAAGC CAAGGCCGAG GCACCCGCGG CCGCACCGGC CAAGGGGCTG GGTCTCGCCG CAGGCGCCAA GCGTCCCGGC GCGAAGAAGA CCGCCACCGA GAAGACCACA GCTGCACCCG AGGCCAAGGC CGAGACTGCG GGCACCACCG AGGCTCCCGC GGAAGCCAAA CCCGAACCCG AGGTCAAGGG GCTGGGTCTG GCCGCAGGCG CCAGGCGTCC CGGCGCGAAG AAGGCGCCCG CCAAGGCGTC GCCGAACGAA GGCGCCGCGA CGGTCGTCCA GCCGCCGAAC GCGGACCCGG ACCAGGCCGA GGCCGGCACC GAGGCCCCGG ACACCGCCGA TTCGGATCGG GGTCTGGAGA CCAAGCCGGA ACCCGAGGTG AAGGGGCTCG GCATCGCCGC GGGCGCCCGT CGCCCGGGTG CGAAGAAGAA ACCCGCCGCA CCGGCTGCTG CGCCCGCGAA GCCGGCCGAA CCCGAGCCGC AAGCCCAGCC TGAAGCCCAA GCCGAACCAG CACCCGAACC CGAGCCGGAG GCACCGTCGC AGCCCGCATC GGGCAACGGC GACGCCCGCG TCGTGGGTGA CGAGCCGCCG GTCAAGGGCC TGGGCATCGC GAAGGGTGCC CGTCGGCCGG GGAAGCGCTG A
|
Protein sequence | MDTQILIRLI VGLLMTGIVI ALAAKRVLWL TKLIRSGAPT SEANNRKDNL GKRITTQFEE VFGQTRLLRW SIPGIAHFFT MWGFFILASV YLEAYGLLFD HDFHIPIIGT WDALGFLQDF FAVAVLLGII TFAIIRIVRE PKKHGRDSRF YGSHTGGAWL ILFMIFNVIW TYALVRGAAA DVGNLPYGNG AFFSQFMGWV LSPLGHDVNE WVETIALLLH IAVMLVFLLI VLHSKHLHIG LAPINVTFKR LPNGLGPLLP MEYNGEPIDF EDPAEDAVLG RGKIEDFTWK GYLDFTTCTE CGRCQSQCPA WNTGKPLSPK LVIMNLRDHM FAKAPYILGD KESPLENTPE GGLGEELRGE KESEKHSHDH VPESGFERIM GSGPEQATRP LVGTLEQGGV IDPDVLWSCT TCGACVEQCP VDIEHIDHIV DMRRYQVMME SEFPGELGVL FKNLETKGNP WGQNAKDRTN WIDEVDFDVP VFGKDVESFE GYEYLFWVGC AGAYEDRAKK TTKAVAELLA IAGVKYLVLG EGETCNGDSA RRSGNEFLFQ QLAAQNVETL NDLFEGVERV DRKVVVTCPH CFNTLGREYP QVGGNYTVLH HTQLLNRLVR DKKLVPVTPA DGGADITYHD PCYLGRHNKV YEAPRELIGA SGAKLTEMPR HADRGLCCGA GGARMWMEEH IGKRVNVERT EEAMDTASTI ATGCPFCRVM ITDGVDDVAA TRNVEKAEVL DVAQLLLGSL DKSGVTLPEK GTAAKEAEER AAVRAEETAA AAPAPEKAPE KEPEAPAEAP AAKASTATES KPAAAAPAKG LGIAGGAKRP GAKKTATEKT AAAPAEAKAE APAAAPAKGL GLAAGAKRPG AKKTATEKTT AAPEAKAETA GTTEAPAEAK PEPEVKGLGL AAGARRPGAK KAPAKASPNE GAATVVQPPN ADPDQAEAGT EAPDTADSDR GLETKPEPEV KGLGIAAGAR RPGAKKKPAA PAAAPAKPAE PEPQAQPEAQ AEPAPEPEPE APSQPASGNG DARVVGDEPP VKGLGIAKGA RRPGKR
|
| |