Gene Cthe_1550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1550 
Symbol 
ID4810057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1878759 
End bp1880048 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content42% 
IMG OID640106969 
Producttransposase IS116/IS110/IS902 
Protein accessionYP_001037970 
Protein GI125974060 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTA GACCCATCGC CGGAATCGAT GTCGGCAAGT TCTTCAGTGA GATGGCAATT 
CTTTCTCCAT CCAATGAAGT AATTGCCCGC ATGAAGATCC GCCATGATTC CAGTACTGAC
GTTGAAAGAG CCGTTGAATT ACTGAAAAAA ACGGAAAAGG ACTTTGATTC TAGGCCTTTC
GTCGTCATGG AATCCACTGG GCACTATCAC AAAATCCTTT TCCATTCACT TTATAAAGCT
GGATTTGAGG TTTCTGTCAT AAACCCCATC CAAACTGATT CTATCAAAAA TATTGGAATA
AGGAAAGTGA AAAATGATAA AGTGGATGCC CGGAAAATTG CTCTGCTATA CAGATTTCAG
GAGCTTAAAA CTACCAATAT CCCCGACGAG GATATTGAAT GTCTGCGAAG CCTTTGCCGC
CAGTACTACA AGCTCTCTGA CGAACTTACT GCTTACAAAA ACAGGCTTAT GGGTATTGTT
GACCAACTCA TGCTAAACTT CAAGGATGTA TTCCCTAATA TCTTTTCAAA GGCTGCTCTT
GCAGTATTGG AGAAATATCC TGCACCTGCG CATATTCTTA AAGCGAACAG AAACAAGTTG
ATTGCACTGA TACAGAAGAA TTCCCGCAGA AGCCTTAAAT GGGCAACTGC AAAGTATGAG
CTTTTGAATT CCAAGGCCAA AGAATTTGCA CCTTTAAGCA TTAGTAACTC TTCAAATGTT
GCCATGCTTG GTGTGTATAT CTCTATGATT AAAACCTTGG AAGAAAACCT TGAGAAAGTC
CTCAAAGCCA TTCGTTCATT GATTATTGAA GATATGGCAA AGGACATGCC CATGCTGGCA
CTGACTCTCG AGCTTCTACA AAGCATTCCA GGTATAGGAC TTATCTCTGC TGTTACCATT
CTGGCTGAAA TTGGCGACTT TTCAGCCTTT TCAAAGCCAG GCAAGCTAGT TGCTTATTTC
GGTATTGACC CCTCTGTAAT GCAGTCCGGA GAGTTTACCG GCACACAAAA CAAGATGTCA
AAAAGGGGGT CAAGACTGCT TCGCAGAGTA CTTTTCACAA TTGCTCTTGC TAATATCCGC
ACCAAGCGGG ACAAAACAGC TTGCAACCCT GTACTGATGG AATATTACAA AAACAAATGC
CAGAGCAAGC CCAAGAAAGT AGCTTTGGGG GCTGTTATGC GTAAGCTTGT TAATTATATT
TTTGCTGTTC TTAGGGATAG AAAGCCTTAC GAATTACGTT CTCCCCAAGA GCACGCGCAA
ATGCTTGCAG CGAAGCACAC AGCAGCTTAG
 
Protein sequence
MNFRPIAGID VGKFFSEMAI LSPSNEVIAR MKIRHDSSTD VERAVELLKK TEKDFDSRPF 
VVMESTGHYH KILFHSLYKA GFEVSVINPI QTDSIKNIGI RKVKNDKVDA RKIALLYRFQ
ELKTTNIPDE DIECLRSLCR QYYKLSDELT AYKNRLMGIV DQLMLNFKDV FPNIFSKAAL
AVLEKYPAPA HILKANRNKL IALIQKNSRR SLKWATAKYE LLNSKAKEFA PLSISNSSNV
AMLGVYISMI KTLEENLEKV LKAIRSLIIE DMAKDMPMLA LTLELLQSIP GIGLISAVTI
LAEIGDFSAF SKPGKLVAYF GIDPSVMQSG EFTGTQNKMS KRGSRLLRRV LFTIALANIR
TKRDKTACNP VLMEYYKNKC QSKPKKVALG AVMRKLVNYI FAVLRDRKPY ELRSPQEHAQ
MLAAKHTAA