Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1051 |
Symbol | |
ID | 4811349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1255085 |
End bp | 1256662 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640106473 |
Product | integral membrane protein MviN |
Protein accession | YP_001037476 |
Protein GI | 125973566 |
COG category | [R] General function prediction only |
COG ID | [COG0728] Uncharacterized membrane protein, putative virulence factor |
TIGRFAM ID | [TIGR01695] integral membrane protein MviN |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000233092 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGGGAAGA ATAAAAAACT TACCGGCGCT GCTCTTATTG TGATGTCATC CATCATAGTC AGCAGAATCA CCGGATTTGT AAGAGAAATG CTTGTGCCGA ACCTTATAGG AGTTAATGAA GAAGGGGATG CTTATACCGT TGCTTTCAAA ATTACAGGTC TGATGTATGA CATGCTGGTA GGAGGTGCGG TGTCTGCGGC CCTTATTCCC GTTCTGTCGG GTTATATTGC CCGGGATGAT GAAGAAACGG GATGGAAAGT GGTAGGTACT TTTATAAACA CTGTTATTGT TGCCATGGTT GCAGTGTGCT TTTTAGGAAT AATTTTTGCA CCCCAGGTGG TTTCGCTGAT TGGCGCAGGA TTTGAAACAG ATGCCCAAAA ACAGCTTACT GTTGATCTTA TAAGGATACT TTTTCCTTCT GTTGCTTTTC TTATGATGGC GGGACTTTGC AATGGGGTTT TGAACTCCTA CAACCGTTTT GCGGCAGCGG CATACGGACC GTCTCTTTAC AATATTGGAA GCGCTCTGAG CATAATTGTA TTCAGCGTCA GCAGATGGGG AGTCAGAGGC GTTGCTTTTG GAGTAATGCT AAGCTCGCTG GTTTATTTTT TGTTCCAGCT GTCCTTTGCC GTTAAAAATC TTAAGCTTTA CAGATTTAAA TTCTATTTGA AGCATGAGGG GTCAAAAAAG CTCTTTAAGC TTGCCATACC TTCTTTGATA TCTTCGGCGA TTGTTCAGAT AAATGCAGTT ATCAGCAGCA CCTTTGCAAC ACTCTTTGGT GTTGGCGGGG CAACGGCACT GAATATCGGG GACAGAACAT GGCAGCTTCC ATACGGTGTT TTTGCCCAGG GCATGGGTAT TGCCATGCTG CCCTCACTGT CTTCAAACAT TGCAAAGGGA GAGGTGGATG AGTATAAAAA CACTCTTATT AAAGGTATAA AGACCGTGCT GTTTTTTACC ATACCGTCCG GTGTTGGCTT TATTGTATTA AAAGAGCCGG TAATCAGGAC TATTTTCAAA TTTACAAGCC GGTTTGACGA AGGGGCTGTA AGTGTTGCGG CAAATGTTCT GATGTTTTTT TCCATTGCGC TTTTGAGCCA GTCCATTGTC ACTGTGACAA ACAGGGCTTT TTATGCGATT AATGACACAC TCACTCCTCT TTTGGTTGGA GGAAGCACTA TAATAATTAA CATACTTTTA AGTATTGTGT TTTATAAAAT GACAAATCTT GGGGTTGCAG GAATGGCTCT TGCTTATTCC CTTGCCAGTG CGGTAAATGC TTTTCTGCTT TTGAGTATTT TAAACAGAAA AATGAAAGGT ATTTATATTG ACAGGCTTCT TAGATTTCTT TTTAAGGTTG TGCCTTCGGC AATGATAATG GGAATGGTTC TTTTTATAAC GAATGCTTTT TTTGTACCGG ATACTTCAGC CAAGGTTGTG CAGCTTTTAA ACCTCATTTT TCAGATAGCC CTGGGTGTGT TGGTGTATTT TGCTGCCGTG TTGGTGCTTA AAGTGGAGGA GGCATTGTAT TTTAAAGATA TGGCTTTATC AAGGCTTAAA AAAATAGTAA AAAAATGA
|
Protein sequence | MGKNKKLTGA ALIVMSSIIV SRITGFVREM LVPNLIGVNE EGDAYTVAFK ITGLMYDMLV GGAVSAALIP VLSGYIARDD EETGWKVVGT FINTVIVAMV AVCFLGIIFA PQVVSLIGAG FETDAQKQLT VDLIRILFPS VAFLMMAGLC NGVLNSYNRF AAAAYGPSLY NIGSALSIIV FSVSRWGVRG VAFGVMLSSL VYFLFQLSFA VKNLKLYRFK FYLKHEGSKK LFKLAIPSLI SSAIVQINAV ISSTFATLFG VGGATALNIG DRTWQLPYGV FAQGMGIAML PSLSSNIAKG EVDEYKNTLI KGIKTVLFFT IPSGVGFIVL KEPVIRTIFK FTSRFDEGAV SVAANVLMFF SIALLSQSIV TVTNRAFYAI NDTLTPLLVG GSTIIINILL SIVFYKMTNL GVAGMALAYS LASAVNAFLL LSILNRKMKG IYIDRLLRFL FKVVPSAMIM GMVLFITNAF FVPDTSAKVV QLLNLIFQIA LGVLVYFAAV LVLKVEEALY FKDMALSRLK KIVKK
|
| |