Gene Cthe_1114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1114 
Symbol 
ID4811412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1327372 
End bp1329258 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content36% 
IMG OID640106536 
ProductTn7-like transposition protein D 
Protein accessionYP_001037539 
Protein GI125973629 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGACGT TTTTTCCTGT GCCATATGAG GATGAAGTAT TATACAGCGT CCTTGCAAGA 
TACCATGTGA GGAGTGGAAA TATAAGTTAT AAGGCGACAA TGAGAGACCT TTTCGGGTCA
ACTTCCGTAA CAGCGGTGAT GGACTTGCCT TCAAATATAC ATAACCTCGT CAATAATATG
CCCCTTAATT CCAGGTATAC CGAAGAATAT CTTATTAAAA ATCATACACT TTTTCCGTTC
TATTCTGCTT TCTTGCCTCC TGAGCGTGCA GAACAAGTTT TTCAATCAAT GAAAGGGGAA
AATGGAGGAA GCATATATAC CCGGACTGGA ATTATGGCCA GTTCTATAGT ATTAAACCAG
TATTTTAAGT TTTGTCCTAC ATGTACTGAA GAAGATAAGT TGCAGTACGG AGAGTTATAC
TGGCATAGAG TTCATCAAAT TTCTGGAGTA CTGGTATGCC CAAAACATTA TGTTCCTTTA
TATAATAGCC TGGTACCGGT CAGGGGATAT AATAAATACC AATATAAGGC TGCTAGCGAA
GAGAACTGTG TAAAGCCAGA TATTAATGTG ATATACGCTG ATGATGTTTT TGAAAAACTG
GTTAGGCTTG CAGAGGATGC TCAGGTTTTA CTTAACAGTG ATTTTGAAAA GAGAAATATA
GAATGGTATA AAGAACAGTA TCTTGCAAAG ATGATGGAAA TGGGATTTGC AACTGTGAAT
GGGAAAGTGT ACCGAAAGAG GCTTATAAAA GAGTTTATTA ATTATTATGG TGAAGAATTT
TTAGATATGG TGCAGTCCAG TGTTGATGTA GATAACGATT CAAATTGGCT AATGGATATG
ATAAGAAAGA AAAACAAGAC TGCTCATCCA ATAAGACATT TGCTTTTGTC ACAGTTTCTT
GGTATTTCAC TTCAAGATTT ATTTAATAAA AAGATGGAAT ATAAGCCTTT TGGAGATGGA
CCATGGCCTT GCTTGAATGC AGCATCAGAC CACTACCTAA AGCCGGTTGT TTCTGATTTA
AAAGTTGCAT ACAGTACGGA TTCAAAATGT CCTGTTGGGA CTTTTTCTTG TACTTGCGGT
TTTGTTTATA CAAGAAGTGG GCCAGATGAG TCTGAAGATG CAAGATATAG GTTCGGAAGG
ATAAAAAAGT TTGGACAAGT ATGGGAGGAG AAACTCAAAG AATTAGTAGA CCGAAAATTG
AGTTTAAGAG AAACAGTAAG GTTATTAGGG GTAGACCCTA TTACAGTTAA GAAGTATGCT
AAGAAACTTG GATTAACGAC TTACTGGGAA AAGCGGAGTG AGGCTGATGT TGTATATGAT
TATGAAAAGA GCAGTTATTC TTCAATGAAA TTGGATGATA AGGATTACTA CAGAAAAAGG
TGGAATGAAT TAAGAAAGCA ATATCCAGAG ATGGGAAAAA CGCAATTACG ACAGGTTGAT
AAGGCTTTAT TTGCCTGGCT TTATAGAAAT GACAGGGAAT GGCTAAACCA GAACTCACCT
GATAAAAAAG TGTCTAATAC TGTAAACAGA AGAGTTGATT GGAATCAAAG AGATAATGAG
ATATTGTCTC AGATAAAAGA AATAGTTGAT AAGATGCTGA ATTCGGACGA AAAGCCTGAA
AGGATTACTA TTAGTCTAAT TGGTAGTAAA TTAGGTATAA GAGGCTTGCT TGAAAAGCAT
TTAGATAAAC TTCCAAAGAC AAAAGCATAC CTGGATTCTG TTAAGGAGAC CAATCATGAC
TTCAGGTTAA GAAGGTTTCG CTGGGCAGTT AAGGAATTAG AGAAAGAAGG AGAAGAACTG
CAACTGTGGA AGATTATGAG GAAGGCTGGG ATAAGGAATA TATATCAAAT TTCAATCCAA
TTTGAAGGAA GTGGTAATTT TAAATAG
 
Protein sequence
MMTFFPVPYE DEVLYSVLAR YHVRSGNISY KATMRDLFGS TSVTAVMDLP SNIHNLVNNM 
PLNSRYTEEY LIKNHTLFPF YSAFLPPERA EQVFQSMKGE NGGSIYTRTG IMASSIVLNQ
YFKFCPTCTE EDKLQYGELY WHRVHQISGV LVCPKHYVPL YNSLVPVRGY NKYQYKAASE
ENCVKPDINV IYADDVFEKL VRLAEDAQVL LNSDFEKRNI EWYKEQYLAK MMEMGFATVN
GKVYRKRLIK EFINYYGEEF LDMVQSSVDV DNDSNWLMDM IRKKNKTAHP IRHLLLSQFL
GISLQDLFNK KMEYKPFGDG PWPCLNAASD HYLKPVVSDL KVAYSTDSKC PVGTFSCTCG
FVYTRSGPDE SEDARYRFGR IKKFGQVWEE KLKELVDRKL SLRETVRLLG VDPITVKKYA
KKLGLTTYWE KRSEADVVYD YEKSSYSSMK LDDKDYYRKR WNELRKQYPE MGKTQLRQVD
KALFAWLYRN DREWLNQNSP DKKVSNTVNR RVDWNQRDNE ILSQIKEIVD KMLNSDEKPE
RITISLIGSK LGIRGLLEKH LDKLPKTKAY LDSVKETNHD FRLRRFRWAV KELEKEGEEL
QLWKIMRKAG IRNIYQISIQ FEGSGNFK