Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmcs_3235 |
Symbol | |
ID | 4112067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. MCS |
Kingdom | Bacteria |
Replicon accession | NC_008146 |
Strand | - |
Start bp | 3429619 |
End bp | 3431169 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638032367 |
Product | hypothetical protein |
Protein accession | YP_640398 |
Protein GI | 108800201 |
COG category | [S] Function unknown |
COG ID | [COG3333] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAATT TCGACTGGCT GATGCAGGGC TTCGCGGAGG CCGCGACACC GATGAACCTG CTCTACGCGA TCATCGGCGT GCTGCTGGGC ACCGCGGTCG GTGTGCTGCC GGGGATCGGC CCCGCGATGA CGGTGGCGCT GCTGCTGCCG GTCACCTACA ACGTCAGCCC GAGCGCGGCG TTCATCATGT TCGCCGGCAT CTTCTACGGC GGCATGTACG GCGGATCGAC CACCTCGATC CTGCTGAACA CCCCCGGTGA ATCGTCGTCG GTGATCACCG CGATCGAGGG CAACAAGATG GCCAAGGCCG GCCGGGCCGC CCAGGCGCTG GCCACCGCCG CGATCGGCTC GTTCGTCGCC GGTGCGATCG GCACTGCGCT GCTCGCGGCC TTCGCACCCC CGATCAGCAG GTTCGCGGTC ACGCTCGGCG CGCCGTCGTA CCTGGCGATC ATGGTGTTCG CGCTGGTCGC GGTCACCGCG GTGCTCGGCG CCTCGAAGCT GCGCGGGGCG ATCTCGCTGT TTCTCGGCCT GGCCATCGGG GTGGTGGGCA TCGACTTCCT CACCGGCCAA CCGCGGGCCA CCTTCGGGCT ACCGCAGCTG TCCGACGGTA TCGACATCGT GGTGATCGCC GTCGCCGTGT TCGCCGTCGG CGAGGCGTTG TGGGTGGCCG CCCACCTGCG GCGGCGCCCC GCGGAGGTGA TCCCGGTGGG CCGGCCGTGG ATGGGTCGCG ACGACTTTCG CCGGTCATGG AAGCCCTGGC TGCGCGGCAC CGCCTACGGC TTCCCGTTCG GTGCGCTGCC CGCCGGCGGC GCCGAACTGC CGACGTTCCT GTCCTACATC ACCGAGAAGA AGCTCGCGAA ACGCACGGGG CACGATGTGG AGTTCGGCAA GGGCGCGATC GAGGGCGTGG CCGGACCGGA GGCGGCCAAC AACGCGTCGG CGGCGGGCAC GCTGGTGCCG ATGCTGTCGC TCGGCCTGCC CACCAACGCC ACCGCGGCGG TCATCCTGAC CGCCTTCGTG TCCTACGGAA TCCAGCCCGG TCCAACGCTT TTCGAGAAGG AGCCGTTGCT GATCTGGACG CTGATCGCCA GCCTGTTCAT CGGCAACCTG CTGCTGTTGG TGCTCAACCT GCCGCTGGCC CCGCTGTGGG CGAAACTGCT GCGCACACCG CGGCCGTACC TGTACGCCGG CATCCTGTTC TTCGCCACGC TGGGTGCTCT GGCCGTCAAC ATCCAACCGC TGGACCTGGC GCTGCTGTTG GTGTTCGGAC TGCTCGGGTT GATGATGCGC CGCTTCGGCC TCCCGGTGCT GCCGTTGATC ATCGGGGTCA TCCTCGGGCC GCGGATCGAA CGCCAACTGC GGCAGAGCCT TCAACTCGGC GGCGGCGACT GGACGAGCCT GTTCACCGAA CCGGTCGCGA TCGTCGTCTA CGTGTTGATG GCGCTGTTAC TGCTGGCCCC CTTGGTGCTC AAGCTCTTTC ACCGTAGTGA GGACACTCTG CTCATCGTGG AGGACGATGT GGACCAACAG GAGAAGGCGG CACGGACATG A
|
Protein sequence | MNNFDWLMQG FAEAATPMNL LYAIIGVLLG TAVGVLPGIG PAMTVALLLP VTYNVSPSAA FIMFAGIFYG GMYGGSTTSI LLNTPGESSS VITAIEGNKM AKAGRAAQAL ATAAIGSFVA GAIGTALLAA FAPPISRFAV TLGAPSYLAI MVFALVAVTA VLGASKLRGA ISLFLGLAIG VVGIDFLTGQ PRATFGLPQL SDGIDIVVIA VAVFAVGEAL WVAAHLRRRP AEVIPVGRPW MGRDDFRRSW KPWLRGTAYG FPFGALPAGG AELPTFLSYI TEKKLAKRTG HDVEFGKGAI EGVAGPEAAN NASAAGTLVP MLSLGLPTNA TAAVILTAFV SYGIQPGPTL FEKEPLLIWT LIASLFIGNL LLLVLNLPLA PLWAKLLRTP RPYLYAGILF FATLGALAVN IQPLDLALLL VFGLLGLMMR RFGLPVLPLI IGVILGPRIE RQLRQSLQLG GGDWTSLFTE PVAIVVYVLM ALLLLAPLVL KLFHRSEDTL LIVEDDVDQQ EKAART
|
| |