Gene Cthe_1115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1115 
Symbol 
ID4811413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1329264 
End bp1330919 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content39% 
IMG OID640106537 
ProductTn7-like transposition protein C 
Protein accessionYP_001037540 
Protein GI125973630 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAG TAATAATACC TAACGGTGCT AATGCTGTAG TTGCAGAATA CAAGGAACAG 
TTGATACCTG AATATAGCGG GAATCCATTT ATTGAAGCAC TACCACCGGT TTACTCTAAG
GAAGAGGTAG TGGAGAAACT GTCTCTGTAT CCCCGCTATA ATCCAGAGGA AAGACGATTG
GAAGACCACT ACCGTATTCA TATGGTGCAG CGGTTGTTCC AGTGCTTTCA GCCGTTGAGT
ATTCATCTTG ACCTAGAAAG TAGAATAAGC AGGGTTATAA GGCAGGGATA CCTGGCACGT
AACCCATTTA AACCTTCCTA TGCTGAAAGC CTACAAGACG GATATAAGGC TATACAAAGC
ATGAAATGGG AGTTAAGCAG CAACGCATCC TTCAGGACTA CTGCATCGGG ATTTACTATT
ATAGGTGTAA GTGGAATGGG TAAGACAACT GCTATCAACC GTGTATTATC TCTTTATCCC
CAGATAATTG TACATTCAGA ATATAATAAT ACCAATTTTA GTATGTACCA ACTGGTTTGG
CTTAAGTTGG ACTGCCCTTT TGATGGTTCT TTGAAAGGCT TGTGTATTGA GTTCTTCCAT
AAGGTAGATG ACCTGTTGGG TACAGACTAT CATAAGAAAT TCGGGGTAGG CAGGAATACG
GTAGACACTA TGCTCTCCGT TATGTCTCAG ATAGCCAGAA ATACAGCGTT AGGAGTATTA
GTAATTGACG AGATTCAACA TCTGAGCAGT GCAAAAAGCG GAGGGGATGA AAAGATGCTT
AACTTTTTTG TCACCCTTGT AAATACTATC GGAGTGCCTA CTGTACTTAT TGGTACAACA
AAAGCATTAT CAGTTTTGCA ATCTGAATTC CGTCAGGCAA GGCGTGGAAG CGGACAAGGG
GATATGATTT GGGAGAGGTT GAGCAAAGAT AAAAGTTGGG AACTGCTTAT CAATGCATTT
TGGGACTATC AATGGACCAA AAAGGAAGTA CCGCTAACAC CTGAATTAAG TGATGTTCTC
TATGAAGAGT CACAAGGCAT TATAGACATT GCGGTAAAAC TTTATGCGAT GTCACAAATA
CGGGCTATTC TTTCAGGGAA AGAGGTTATC ACAGCAAATC TGATTAGGCA GGTTGCAAAA
GATAATTTAA AATTGGTTCG TCCTATGCTG GAAGCATTAA AATCAGGAAA TATTAAAGAA
ATCGCAAAAT ATGAAGATAT TTGCACTGTA GACATTGATT TTATGGGATT TGTGGACAAA
AGCAAACAGT CAGTAGATTG GGACATGAGG ATGAAAATGC TTCAAAAGCA GCAAAAGAAA
AAAGAAGAGG AAGTCAATCT TTCAAAGAAG GAACAGGCAA TTCTTAAATT GCTGGACTTA
AATATTGATG CCAAAAAGGC TCAAAAAGCA GTTGAGAAGG TTCTTGATAA GGAAGAAGGG
CTTGAAGTTT CTGAGATTGT AATAAAGGCT GTACAGATGA TAGCAAACAA TGGTAAATTA
AAACAAAAGG AAAAGAGTAA AGCAAAGAAT ATGGATGAAA ATGATATAAG GTATATTGTG
GAAGAAGGCA GGAAAAATAA AAAATCAGCC TATGAATCAT TAAATGAAAA AGGGCTTATT
AAGCAGGTAG AAAAAGACTT TTTCAAGGCG GTGTAG
 
Protein sequence
MNKVIIPNGA NAVVAEYKEQ LIPEYSGNPF IEALPPVYSK EEVVEKLSLY PRYNPEERRL 
EDHYRIHMVQ RLFQCFQPLS IHLDLESRIS RVIRQGYLAR NPFKPSYAES LQDGYKAIQS
MKWELSSNAS FRTTASGFTI IGVSGMGKTT AINRVLSLYP QIIVHSEYNN TNFSMYQLVW
LKLDCPFDGS LKGLCIEFFH KVDDLLGTDY HKKFGVGRNT VDTMLSVMSQ IARNTALGVL
VIDEIQHLSS AKSGGDEKML NFFVTLVNTI GVPTVLIGTT KALSVLQSEF RQARRGSGQG
DMIWERLSKD KSWELLINAF WDYQWTKKEV PLTPELSDVL YEESQGIIDI AVKLYAMSQI
RAILSGKEVI TANLIRQVAK DNLKLVRPML EALKSGNIKE IAKYEDICTV DIDFMGFVDK
SKQSVDWDMR MKMLQKQQKK KEEEVNLSKK EQAILKLLDL NIDAKKAQKA VEKVLDKEEG
LEVSEIVIKA VQMIANNGKL KQKEKSKAKN MDENDIRYIV EEGRKNKKSA YESLNEKGLI
KQVEKDFFKA V