Gene Cthe_2454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2454 
Symbol 
ID4809833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2927793 
End bp2930921 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content42% 
IMG OID640107868 
Productfibronectin, type III 
Protein accessionYP_001038849 
Protein GI125974939 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAA TCATAGATTA CTTCAAAAAA TTGACCGTAT TCATTCTTAT CATTCTGATA 
ACAACTTTTT GCCTTAGAAA CACTGCACTG GCTGATTTCA CTCTTCACGC CGCTCCCGCG
GATTTCAACT CCGACACTCA AATAAATCTT AACTGGACTT CGGTAACAAA TGCGGTCTAT
TACAGTATTG TCAGAAACGA CGTGCATATC GCCAATATAG ACATTGACCT TGTGAGAAAC
TATCTGAGCT TTCAGGACAC AGGACTTGTG CCTCAGACCT CCTACAGCTA TACAGTAACC
GCTGTGGATT CCAACGGAAG GTCCATTAAA TCTGCAAGCA GCACGGTTTC CACACTGCAA
ATGAAAGCTC CTTCCATCGT TTCTTCATGT CTGGACATTA ATACCAACGA AATTACGCTG
ACATGGACAA ACAACTCCCT TGCCGTAAAC GGTACAATAG TAAACAAATC CGGTGAAGGA
CAAATTGCCG AAGTTTCCGG CAAAAACACC TCTGTAACTT TCGTTGATCC AAACCTGACT
TACGGCGTCG AGGCGCAATA TACAATTATG TCAATAGACG GAAATGGGCA CTCATCCCCA
TCTTCCGATC CTGTTTCAAT AATACCGATA GTTCCGCCGG TGATAGAAGC TTCAATAAAA
AACAGCACAG TAACAATTTC GTGGCAGCCG CACGACCACA TTCACGAATT CAGACTTGAA
AGAGCCAAAT ACCTGGAAGA TAAAAAGAAA TGGGGCTCAT GGGAAATAAT AAAAACGAGT
CTTGCGAAAA ACAGTACCAG TACCACTGAC AGCATCAGCA GCGATGGAAC GTATAGATAC
AGGCTCAGCA TCGACAATGG AAAATATAAA GGTTACAGTA ACATATCAAA TCCCGTGGCA
AGGCTTTTGG CTCCTACAAA CATTCAATGC GTCCCCGTAG ATTCCGGAAG AATCGACATA
AGCTGGAGCC TTCCTTCAAA AGGCAACTTT ACCCTTAAAA TAGAGAAAAG AATTGATTCG
GGAAGCTATT ACACATTAGC CGTCCTTGAC AGCAATATAA CATCTTATTC CGACACAGAG
GGAATTCTTC CAAACAAATA TTACTACTAC CGGATAACAG CATATGATGA AAACGGCAAC
TCTGCTTCAT CTCCTGTGTA TTCCATATAC ACAGGTAAGC CGTCTGCAGC ATCAAACTTA
ATGCTGGAGA TTACATCTCC CACAAAAATA ACCCTGAACT GGAAAGATTC TTCCAGCAAT
GAATCAGGCT TCAGAATTGA AAGAAAAGTT GATACAGGCA CCTTTGTTCC CATTGCAACT
TTGCCTGCAA ATACCACTTC TTATGTAGAC AACAACGTAA ACCCTCAGTC CACTTATACT
TACAGAGTCA TGTCTTTCAA CTCAATAGGA GATGCTTCAT CTTACAGCAA TGAAGTTTCC
TGCACAACAT CAATCTTGAA AGGACCTCCT TCGTCGCTGA CAGTTACACC GTTGTCTGCA
AATGAGATTG AGCTTGGCTG GACATATGCA GGCTCCGCAA GTTACAGTAC CATTATTGAA
AGAAAAACAG GCGACGGCAG TACCTGGGAA ATGGTAACCA CACTGCCGGC AGGCTATACA
AGTTACAGAG ATACGGACCT TCTTCCCAAC ACCCGGTATT TTTACAGAGT AAGGACAAAT
CTTGCCAGCA GGGTTTACTC CAGGCCTTAT CCTGAAAAAG CTGACGACAC CGGTGTATAT
ACTTATTTTG AAACTCCGGA GAAGTTAACA TCCGCCTGGA CTGTTTCCGG GTTTGTAAAA
CTTTCCTGGA AGTATGAGCC CTGGGATGAT GAGGAAATTA TAATAGAACG CAAAACAGGG
AACGGAAAAT TCATAGAAAT CGGAAGACAG GATGCGGATA AAACCGTATG GTATGATTAC
GACACAGACC CTGAAACTGA TTATACTTAC AGAATCAAAG CCGTCAACGC TTACAACTCG
TCGGAGTATT CAAATGAATC TTCGGTCGAC GCCATAAGCT TTAAACCGCC GGAAAACCTC
TCTGCAACCA TTGTATCAAA TACCGAGATA ATCCTAAGCT GGACCGACAT GACCGAGGAT
GAGTCCGGCT TCAGGGTGGA AATGAAAGAA GGAAACAGCG AAAACTGGAA AAAGGTCGTC
TCACTTTCCA AAAACACCAC CAGCTATACA ATCAAAGAAT TAAAACCGGA CACTCTTTAC
CATTTCAGAG TGGTCGCGGA AAAGTCGGCT TACCGTTTTG AGGTGTACAG CGAGGAAATT
CAGGTGCTTA TGAAATCTTT GGCCGCGCCG TCGAACCTTG TGGTAAGAGC TGTTTCTCCA
AATCAAATTG TTCTGGAATG GAAGGACAAT TCCGATGATG AAGAAGGCTT TGTCATTGAA
AGAAAATCAA ACAATGGAGA TTTCACTGAA ATTGCAAAAG TAGCCAAAAA TATCACCAAA
TTTACAGATA ATCAGCTTTA TGCCAATACC ACATATTACT ATAGAGTCAA AGGTTATTAT
AAAAACACAT ACACCAATTA TACCAATATC GGAATGGCCA AAACCGCCCT TTCCAAAACC
TTTAACGATT TAAACTCCGT TCCCTGGGCA AAAGAAGCCA TAGAAAGTCT GGCGGCAAGA
GGCATTATAC ACGGAAAATC GGAAGAGCAG GGAATATTCG CACCCAATGA CCGAATAACC
CGGGCAGAAT TTATAACACT TATAGTAAAT GCATTCCAGC TTAGCAAGAC CCCGACAGGT
ACTTTTGCCG ATGTAAAGCC CAATCATTGG TATTACAGAA ATGTCATGAT TGCCAAAAAC
ATGGGCATTG TTTCAGGAAC GGGCAACAAC TACTTTTATC CCGATGATCC TATAAAAAGG
GAAGACATGG CAGTAATTCT GGCAAAAACT TTTAAAATCA TAGGAAAGCC ACTGCCAAAT
CACAGTGATT CAGTTTTAGA CAAATATTCA GATAAAAATC TTATTTCCAT ATATGCTCTG
CAGAGCATGG CCATACTAAA CGGAGAAGGC ATTATAACCG GCAAAAGCAG CTCACAGTTG
TCCCCTAAAG ACTACGCCAC AAGAGCGGAG GCGGCAGTAA TATTGTACAA AGCTTTAAAC
AAACTTTAA
 
Protein sequence
MKLIIDYFKK LTVFILIILI TTFCLRNTAL ADFTLHAAPA DFNSDTQINL NWTSVTNAVY 
YSIVRNDVHI ANIDIDLVRN YLSFQDTGLV PQTSYSYTVT AVDSNGRSIK SASSTVSTLQ
MKAPSIVSSC LDINTNEITL TWTNNSLAVN GTIVNKSGEG QIAEVSGKNT SVTFVDPNLT
YGVEAQYTIM SIDGNGHSSP SSDPVSIIPI VPPVIEASIK NSTVTISWQP HDHIHEFRLE
RAKYLEDKKK WGSWEIIKTS LAKNSTSTTD SISSDGTYRY RLSIDNGKYK GYSNISNPVA
RLLAPTNIQC VPVDSGRIDI SWSLPSKGNF TLKIEKRIDS GSYYTLAVLD SNITSYSDTE
GILPNKYYYY RITAYDENGN SASSPVYSIY TGKPSAASNL MLEITSPTKI TLNWKDSSSN
ESGFRIERKV DTGTFVPIAT LPANTTSYVD NNVNPQSTYT YRVMSFNSIG DASSYSNEVS
CTTSILKGPP SSLTVTPLSA NEIELGWTYA GSASYSTIIE RKTGDGSTWE MVTTLPAGYT
SYRDTDLLPN TRYFYRVRTN LASRVYSRPY PEKADDTGVY TYFETPEKLT SAWTVSGFVK
LSWKYEPWDD EEIIIERKTG NGKFIEIGRQ DADKTVWYDY DTDPETDYTY RIKAVNAYNS
SEYSNESSVD AISFKPPENL SATIVSNTEI ILSWTDMTED ESGFRVEMKE GNSENWKKVV
SLSKNTTSYT IKELKPDTLY HFRVVAEKSA YRFEVYSEEI QVLMKSLAAP SNLVVRAVSP
NQIVLEWKDN SDDEEGFVIE RKSNNGDFTE IAKVAKNITK FTDNQLYANT TYYYRVKGYY
KNTYTNYTNI GMAKTALSKT FNDLNSVPWA KEAIESLAAR GIIHGKSEEQ GIFAPNDRIT
RAEFITLIVN AFQLSKTPTG TFADVKPNHW YYRNVMIAKN MGIVSGTGNN YFYPDDPIKR
EDMAVILAKT FKIIGKPLPN HSDSVLDKYS DKNLISIYAL QSMAILNGEG IITGKSSSQL
SPKDYATRAE AAVILYKALN KL