Gene Cthe_2770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2770 
Symbol 
ID4810087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3272258 
End bp3273547 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content42% 
IMG OID640108190 
Producttransposase IS116/IS110/IS902 
Protein accessionYP_001039162 
Protein GI125975252 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTA GACCCATCGC CGGAATCGAT GTCGGCAAGT TCTTCAGTGA GATGGCAATT 
CTTTCTCCAT CCAATGAAGT AATTGCCCGC ATGAAGATCC GCCATGATTC CAGTACTGAC
GTTGAAAGAG CCGTTGAATT ACTGAAAAAA ACGGAAAAGG ACTTTGATTC TAGGCCTTTC
GTCGTCATGG AATCCACTGG GCACTATCAC AAAATCCTTT TCCATTCACT TTATAAAGCT
GGATTTGAGG TTTCTGTCAT AAACCCCATC CAAACTGATT CTATCAAAAA TATTGGAATA
AGGAAAGTGA AAAATGATAA AGTGGATGCC CGGAAAATTG CTCTGCTATA CAGATTTCAG
GAGCTTAAAA CTACCAATAT CCCCGACGAG GATATTGAAT GTCTGCGAAG CCTTTGCCGC
CAGTACTACA AGCTCTCTGA CGAACTTACT GCTTACAAAA ACAGGCTTAT GGGTATTGTT
GACCAACTCA TGCTAAACTT CAAGGATGTA TTCCCTAATA TCTTTTCAAA GGCTGCTCTT
GCAGTATTGG AGAAATATCC TGCACCTGCG CATATTCTTA AAGCGAACAG AAACAAGTTG
ATTGCACTGA TACAGAAGAA TTCCCGCAGA AGCCTTAAAT GGGCAACTGC AAAGTATGAG
CTTTTGAATT CCAAGGCCAA AGAATTTGCA CCTTTAAGCA TTAGTAACTC TTCAAATGTT
GCCATGCTTG GTGTGTATAT CTCTATGATT AAAACCTTGG AAGAAAACCT TGAGAAAGTC
CTCAAAGCCA TTCGTTCATT GATTATTGAA GATATGGCAA AGGACATGCC CATGCTGGCA
CTGACTCTCG AGCTTCTACA AAGCATTCCA GGTATAGGAC TTATCTCTGC TGTTACCATT
CTGGCTGAAA TTGGCGACTT TTCAGCCTTT TCAAAGCCAG GCAAGCTAGT TGCTTATTTC
GGTATTGACC CCTCTGTAAT GCAGTCCGGA GAGTTTACCG GCACACAAAA CAAGATGTCA
AAAAGGGGGT CAAGACTGCT TCGCAGAGTA CTTTTCACAA TTGCTCTTGC TAATATCCGC
ACCAAGCGGG ACAAAACAGC TTGCAACCCT GTACTGATGG AATATTACAA AAACAAATGC
CAGAGCAAGC CCAAGAAAGT AGCTTTGGGG GCTGTTATGC GTAAGCTTGT TAATTATATT
TTTGCTGTTC TTAGGGATAG AAAGCCTTAC GAATTACGTT CTCCCCAAGA GCACGCGCAA
ATGCTTGCAG CGAAGCACAC AGCAGCTTAG
 
Protein sequence
MNFRPIAGID VGKFFSEMAI LSPSNEVIAR MKIRHDSSTD VERAVELLKK TEKDFDSRPF 
VVMESTGHYH KILFHSLYKA GFEVSVINPI QTDSIKNIGI RKVKNDKVDA RKIALLYRFQ
ELKTTNIPDE DIECLRSLCR QYYKLSDELT AYKNRLMGIV DQLMLNFKDV FPNIFSKAAL
AVLEKYPAPA HILKANRNKL IALIQKNSRR SLKWATAKYE LLNSKAKEFA PLSISNSSNV
AMLGVYISMI KTLEENLEKV LKAIRSLIIE DMAKDMPMLA LTLELLQSIP GIGLISAVTI
LAEIGDFSAF SKPGKLVAYF GIDPSVMQSG EFTGTQNKMS KRGSRLLRRV LFTIALANIR
TKRDKTACNP VLMEYYKNKC QSKPKKVALG AVMRKLVNYI FAVLRDRKPY ELRSPQEHAQ
MLAAKHTAA