Gene Cthe_1051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1051 
Symbol 
ID4811349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1255085 
End bp1256662 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content41% 
IMG OID640106473 
Productintegral membrane protein MviN 
Protein accessionYP_001037476 
Protein GI125973566 
COG category[R] General function prediction only 
COG ID[COG0728] Uncharacterized membrane protein, putative virulence factor 
TIGRFAM ID[TIGR01695] integral membrane protein MviN 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000233092 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGGAAGA ATAAAAAACT TACCGGCGCT GCTCTTATTG TGATGTCATC CATCATAGTC 
AGCAGAATCA CCGGATTTGT AAGAGAAATG CTTGTGCCGA ACCTTATAGG AGTTAATGAA
GAAGGGGATG CTTATACCGT TGCTTTCAAA ATTACAGGTC TGATGTATGA CATGCTGGTA
GGAGGTGCGG TGTCTGCGGC CCTTATTCCC GTTCTGTCGG GTTATATTGC CCGGGATGAT
GAAGAAACGG GATGGAAAGT GGTAGGTACT TTTATAAACA CTGTTATTGT TGCCATGGTT
GCAGTGTGCT TTTTAGGAAT AATTTTTGCA CCCCAGGTGG TTTCGCTGAT TGGCGCAGGA
TTTGAAACAG ATGCCCAAAA ACAGCTTACT GTTGATCTTA TAAGGATACT TTTTCCTTCT
GTTGCTTTTC TTATGATGGC GGGACTTTGC AATGGGGTTT TGAACTCCTA CAACCGTTTT
GCGGCAGCGG CATACGGACC GTCTCTTTAC AATATTGGAA GCGCTCTGAG CATAATTGTA
TTCAGCGTCA GCAGATGGGG AGTCAGAGGC GTTGCTTTTG GAGTAATGCT AAGCTCGCTG
GTTTATTTTT TGTTCCAGCT GTCCTTTGCC GTTAAAAATC TTAAGCTTTA CAGATTTAAA
TTCTATTTGA AGCATGAGGG GTCAAAAAAG CTCTTTAAGC TTGCCATACC TTCTTTGATA
TCTTCGGCGA TTGTTCAGAT AAATGCAGTT ATCAGCAGCA CCTTTGCAAC ACTCTTTGGT
GTTGGCGGGG CAACGGCACT GAATATCGGG GACAGAACAT GGCAGCTTCC ATACGGTGTT
TTTGCCCAGG GCATGGGTAT TGCCATGCTG CCCTCACTGT CTTCAAACAT TGCAAAGGGA
GAGGTGGATG AGTATAAAAA CACTCTTATT AAAGGTATAA AGACCGTGCT GTTTTTTACC
ATACCGTCCG GTGTTGGCTT TATTGTATTA AAAGAGCCGG TAATCAGGAC TATTTTCAAA
TTTACAAGCC GGTTTGACGA AGGGGCTGTA AGTGTTGCGG CAAATGTTCT GATGTTTTTT
TCCATTGCGC TTTTGAGCCA GTCCATTGTC ACTGTGACAA ACAGGGCTTT TTATGCGATT
AATGACACAC TCACTCCTCT TTTGGTTGGA GGAAGCACTA TAATAATTAA CATACTTTTA
AGTATTGTGT TTTATAAAAT GACAAATCTT GGGGTTGCAG GAATGGCTCT TGCTTATTCC
CTTGCCAGTG CGGTAAATGC TTTTCTGCTT TTGAGTATTT TAAACAGAAA AATGAAAGGT
ATTTATATTG ACAGGCTTCT TAGATTTCTT TTTAAGGTTG TGCCTTCGGC AATGATAATG
GGAATGGTTC TTTTTATAAC GAATGCTTTT TTTGTACCGG ATACTTCAGC CAAGGTTGTG
CAGCTTTTAA ACCTCATTTT TCAGATAGCC CTGGGTGTGT TGGTGTATTT TGCTGCCGTG
TTGGTGCTTA AAGTGGAGGA GGCATTGTAT TTTAAAGATA TGGCTTTATC AAGGCTTAAA
AAAATAGTAA AAAAATGA
 
Protein sequence
MGKNKKLTGA ALIVMSSIIV SRITGFVREM LVPNLIGVNE EGDAYTVAFK ITGLMYDMLV 
GGAVSAALIP VLSGYIARDD EETGWKVVGT FINTVIVAMV AVCFLGIIFA PQVVSLIGAG
FETDAQKQLT VDLIRILFPS VAFLMMAGLC NGVLNSYNRF AAAAYGPSLY NIGSALSIIV
FSVSRWGVRG VAFGVMLSSL VYFLFQLSFA VKNLKLYRFK FYLKHEGSKK LFKLAIPSLI
SSAIVQINAV ISSTFATLFG VGGATALNIG DRTWQLPYGV FAQGMGIAML PSLSSNIAKG
EVDEYKNTLI KGIKTVLFFT IPSGVGFIVL KEPVIRTIFK FTSRFDEGAV SVAANVLMFF
SIALLSQSIV TVTNRAFYAI NDTLTPLLVG GSTIIINILL SIVFYKMTNL GVAGMALAYS
LASAVNAFLL LSILNRKMKG IYIDRLLRFL FKVVPSAMIM GMVLFITNAF FVPDTSAKVV
QLLNLIFQIA LGVLVYFAAV LVLKVEEALY FKDMALSRLK KIVKK