Gene Cthe_2636 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2636 
Symbol 
ID4808947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3117237 
End bp3118775 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content37% 
IMG OID640108049 
Productintegral membrane protein MviN 
Protein accessionYP_001039028 
Protein GI125975118 
COG category[R] General function prediction only 
COG ID[COG0728] Uncharacterized membrane protein, putative virulence factor 
TIGRFAM ID[TIGR01695] integral membrane protein MviN 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA AAATTGCCAT AGTATTAGCC ATTATAACGA TCATATCAAA ATTCTTCGGT 
TTCTTCAGGG AGATTATCCT GTCGTACTTT TATGGTGTAA GCAATGAAAG CGATGCCTAT
ATAATAGCCC TTACAATACC AACTGTCATT TTCGCATTTG TGGGCACCGG GCTTGCCACG
ACATTTATTC CCATATACAA CAGCATATTG GCACAAAAAG GTGAAAAGGC CGCAAATGCT
TTTACCAATA AAGTCATAAA CATAATATTT GTTATTTCCT CCGTAATAGT CCTTTTAATA
TTTGTCTTTA CAGAGCATAC AGTCAAATTG TTTGCATACG GTTTCGACAA AGAAACTATG
GAGTTGGCAG TCCAATTTAC CAGAATAATT TCCCTGGGGA TTTATTTTAT CGGGCTTGGC
TATGTTTTTA AAAGTCTGCT TCAAATAAAA GATAATTTTA TCGTCCCGGC AATAGTGGGA
TTCCCATATA ATTTCATAGT CATAATATCC ATCATTGCAA GTACAAAGTG GAATATTATG
ATTTTGCCTC TGGGCACTTT TATTGCCACA TCTCTGGAAA CCATTGTTTT GTTTCCAGGC
ATAATAAAGT CGGGATACAA ATACCTGCTC GACTTTAAAA TTGACAACCA CATAAAAAAG
ATGTTTTTTC TGTCAATACC GGTTATACTG GGAACATCTG TAAACCAAAT CAACAAACTT
GTTGACAGAA CTTTGGCCTC CCAGATTTCC GTGGGAGGAA TTTCTGCATT AAATTACGCG
TCAAGACTGA ACAATTTTGT CCAGGGAGTC TTTGTGGTTT CAGTGATTGC GGTAATGTAT
CCCGCAATAT CCAAACTGGC AGCTGAAAAT AATATGAAAG AACTTAAAAA AGTATTGTCG
GAATCAATTA TCGGAGTAAC ATTGCTATTA GTGCCGCTGT CTGTAGGTGC CATGATTTTT
TCAAAAGAAA TAGTTGCATT GTTGTTTGGC AGGGGAGCAT TTGACAAAAC CGCGGTAGAT
ATGACTTCCG TATCCCTGTT CTATTATTCC ATAGGTATGC TGGCATTTGG AATCAGGGAT
GTTCTTTCAA GAGTGTTTTA CTCTGTCAAA GACACTAAAA CCCCAACAAT TAACGCAGGT
ATCGGCATGG CGCTCAATAT TGTTTTGAAT ATAATTTTGT CCCGATACAT GGGAATCGGG
GGTCTGGCAC TGGCAACCAG CATAGTAGGC ATATTCATCA CAATATTGAT GTTTGTAACG
CTGAGAAAAA AAATAGGCCC CTTGGGAATG AAAGCAATGA GTTTTAAATT CTTCAAGATT
TTGGTATCTT CATTGCTTAT GGGAGTAATA GCCCACATAT CTTACAGATA TCTTGAAAAT
TTCGCAGGTT CCAATATTTC AATCATAATA TCAATCACAG GCGGTGCATT GATATACTTT
GTGATTATCT ATTTTATGAA AATCGAGGAT GTGGAAGTTT TGGTAAAACA GTTTAAGCGC
AAATTATTCG GCAGAAAAAA ACAGCTCAAT AACGGTTAA
 
Protein sequence
MKKKIAIVLA IITIISKFFG FFREIILSYF YGVSNESDAY IIALTIPTVI FAFVGTGLAT 
TFIPIYNSIL AQKGEKAANA FTNKVINIIF VISSVIVLLI FVFTEHTVKL FAYGFDKETM
ELAVQFTRII SLGIYFIGLG YVFKSLLQIK DNFIVPAIVG FPYNFIVIIS IIASTKWNIM
ILPLGTFIAT SLETIVLFPG IIKSGYKYLL DFKIDNHIKK MFFLSIPVIL GTSVNQINKL
VDRTLASQIS VGGISALNYA SRLNNFVQGV FVVSVIAVMY PAISKLAAEN NMKELKKVLS
ESIIGVTLLL VPLSVGAMIF SKEIVALLFG RGAFDKTAVD MTSVSLFYYS IGMLAFGIRD
VLSRVFYSVK DTKTPTINAG IGMALNIVLN IILSRYMGIG GLALATSIVG IFITILMFVT
LRKKIGPLGM KAMSFKFFKI LVSSLLMGVI AHISYRYLEN FAGSNISIII SITGGALIYF
VIIYFMKIED VEVLVKQFKR KLFGRKKQLN NG