Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2454 |
Symbol | |
ID | 4809833 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2927793 |
End bp | 2930921 |
Gene Length | 3129 bp |
Protein Length | 1042 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640107868 |
Product | fibronectin, type III |
Protein accession | YP_001038849 |
Protein GI | 125974939 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTAA TCATAGATTA CTTCAAAAAA TTGACCGTAT TCATTCTTAT CATTCTGATA ACAACTTTTT GCCTTAGAAA CACTGCACTG GCTGATTTCA CTCTTCACGC CGCTCCCGCG GATTTCAACT CCGACACTCA AATAAATCTT AACTGGACTT CGGTAACAAA TGCGGTCTAT TACAGTATTG TCAGAAACGA CGTGCATATC GCCAATATAG ACATTGACCT TGTGAGAAAC TATCTGAGCT TTCAGGACAC AGGACTTGTG CCTCAGACCT CCTACAGCTA TACAGTAACC GCTGTGGATT CCAACGGAAG GTCCATTAAA TCTGCAAGCA GCACGGTTTC CACACTGCAA ATGAAAGCTC CTTCCATCGT TTCTTCATGT CTGGACATTA ATACCAACGA AATTACGCTG ACATGGACAA ACAACTCCCT TGCCGTAAAC GGTACAATAG TAAACAAATC CGGTGAAGGA CAAATTGCCG AAGTTTCCGG CAAAAACACC TCTGTAACTT TCGTTGATCC AAACCTGACT TACGGCGTCG AGGCGCAATA TACAATTATG TCAATAGACG GAAATGGGCA CTCATCCCCA TCTTCCGATC CTGTTTCAAT AATACCGATA GTTCCGCCGG TGATAGAAGC TTCAATAAAA AACAGCACAG TAACAATTTC GTGGCAGCCG CACGACCACA TTCACGAATT CAGACTTGAA AGAGCCAAAT ACCTGGAAGA TAAAAAGAAA TGGGGCTCAT GGGAAATAAT AAAAACGAGT CTTGCGAAAA ACAGTACCAG TACCACTGAC AGCATCAGCA GCGATGGAAC GTATAGATAC AGGCTCAGCA TCGACAATGG AAAATATAAA GGTTACAGTA ACATATCAAA TCCCGTGGCA AGGCTTTTGG CTCCTACAAA CATTCAATGC GTCCCCGTAG ATTCCGGAAG AATCGACATA AGCTGGAGCC TTCCTTCAAA AGGCAACTTT ACCCTTAAAA TAGAGAAAAG AATTGATTCG GGAAGCTATT ACACATTAGC CGTCCTTGAC AGCAATATAA CATCTTATTC CGACACAGAG GGAATTCTTC CAAACAAATA TTACTACTAC CGGATAACAG CATATGATGA AAACGGCAAC TCTGCTTCAT CTCCTGTGTA TTCCATATAC ACAGGTAAGC CGTCTGCAGC ATCAAACTTA ATGCTGGAGA TTACATCTCC CACAAAAATA ACCCTGAACT GGAAAGATTC TTCCAGCAAT GAATCAGGCT TCAGAATTGA AAGAAAAGTT GATACAGGCA CCTTTGTTCC CATTGCAACT TTGCCTGCAA ATACCACTTC TTATGTAGAC AACAACGTAA ACCCTCAGTC CACTTATACT TACAGAGTCA TGTCTTTCAA CTCAATAGGA GATGCTTCAT CTTACAGCAA TGAAGTTTCC TGCACAACAT CAATCTTGAA AGGACCTCCT TCGTCGCTGA CAGTTACACC GTTGTCTGCA AATGAGATTG AGCTTGGCTG GACATATGCA GGCTCCGCAA GTTACAGTAC CATTATTGAA AGAAAAACAG GCGACGGCAG TACCTGGGAA ATGGTAACCA CACTGCCGGC AGGCTATACA AGTTACAGAG ATACGGACCT TCTTCCCAAC ACCCGGTATT TTTACAGAGT AAGGACAAAT CTTGCCAGCA GGGTTTACTC CAGGCCTTAT CCTGAAAAAG CTGACGACAC CGGTGTATAT ACTTATTTTG AAACTCCGGA GAAGTTAACA TCCGCCTGGA CTGTTTCCGG GTTTGTAAAA CTTTCCTGGA AGTATGAGCC CTGGGATGAT GAGGAAATTA TAATAGAACG CAAAACAGGG AACGGAAAAT TCATAGAAAT CGGAAGACAG GATGCGGATA AAACCGTATG GTATGATTAC GACACAGACC CTGAAACTGA TTATACTTAC AGAATCAAAG CCGTCAACGC TTACAACTCG TCGGAGTATT CAAATGAATC TTCGGTCGAC GCCATAAGCT TTAAACCGCC GGAAAACCTC TCTGCAACCA TTGTATCAAA TACCGAGATA ATCCTAAGCT GGACCGACAT GACCGAGGAT GAGTCCGGCT TCAGGGTGGA AATGAAAGAA GGAAACAGCG AAAACTGGAA AAAGGTCGTC TCACTTTCCA AAAACACCAC CAGCTATACA ATCAAAGAAT TAAAACCGGA CACTCTTTAC CATTTCAGAG TGGTCGCGGA AAAGTCGGCT TACCGTTTTG AGGTGTACAG CGAGGAAATT CAGGTGCTTA TGAAATCTTT GGCCGCGCCG TCGAACCTTG TGGTAAGAGC TGTTTCTCCA AATCAAATTG TTCTGGAATG GAAGGACAAT TCCGATGATG AAGAAGGCTT TGTCATTGAA AGAAAATCAA ACAATGGAGA TTTCACTGAA ATTGCAAAAG TAGCCAAAAA TATCACCAAA TTTACAGATA ATCAGCTTTA TGCCAATACC ACATATTACT ATAGAGTCAA AGGTTATTAT AAAAACACAT ACACCAATTA TACCAATATC GGAATGGCCA AAACCGCCCT TTCCAAAACC TTTAACGATT TAAACTCCGT TCCCTGGGCA AAAGAAGCCA TAGAAAGTCT GGCGGCAAGA GGCATTATAC ACGGAAAATC GGAAGAGCAG GGAATATTCG CACCCAATGA CCGAATAACC CGGGCAGAAT TTATAACACT TATAGTAAAT GCATTCCAGC TTAGCAAGAC CCCGACAGGT ACTTTTGCCG ATGTAAAGCC CAATCATTGG TATTACAGAA ATGTCATGAT TGCCAAAAAC ATGGGCATTG TTTCAGGAAC GGGCAACAAC TACTTTTATC CCGATGATCC TATAAAAAGG GAAGACATGG CAGTAATTCT GGCAAAAACT TTTAAAATCA TAGGAAAGCC ACTGCCAAAT CACAGTGATT CAGTTTTAGA CAAATATTCA GATAAAAATC TTATTTCCAT ATATGCTCTG CAGAGCATGG CCATACTAAA CGGAGAAGGC ATTATAACCG GCAAAAGCAG CTCACAGTTG TCCCCTAAAG ACTACGCCAC AAGAGCGGAG GCGGCAGTAA TATTGTACAA AGCTTTAAAC AAACTTTAA
|
Protein sequence | MKLIIDYFKK LTVFILIILI TTFCLRNTAL ADFTLHAAPA DFNSDTQINL NWTSVTNAVY YSIVRNDVHI ANIDIDLVRN YLSFQDTGLV PQTSYSYTVT AVDSNGRSIK SASSTVSTLQ MKAPSIVSSC LDINTNEITL TWTNNSLAVN GTIVNKSGEG QIAEVSGKNT SVTFVDPNLT YGVEAQYTIM SIDGNGHSSP SSDPVSIIPI VPPVIEASIK NSTVTISWQP HDHIHEFRLE RAKYLEDKKK WGSWEIIKTS LAKNSTSTTD SISSDGTYRY RLSIDNGKYK GYSNISNPVA RLLAPTNIQC VPVDSGRIDI SWSLPSKGNF TLKIEKRIDS GSYYTLAVLD SNITSYSDTE GILPNKYYYY RITAYDENGN SASSPVYSIY TGKPSAASNL MLEITSPTKI TLNWKDSSSN ESGFRIERKV DTGTFVPIAT LPANTTSYVD NNVNPQSTYT YRVMSFNSIG DASSYSNEVS CTTSILKGPP SSLTVTPLSA NEIELGWTYA GSASYSTIIE RKTGDGSTWE MVTTLPAGYT SYRDTDLLPN TRYFYRVRTN LASRVYSRPY PEKADDTGVY TYFETPEKLT SAWTVSGFVK LSWKYEPWDD EEIIIERKTG NGKFIEIGRQ DADKTVWYDY DTDPETDYTY RIKAVNAYNS SEYSNESSVD AISFKPPENL SATIVSNTEI ILSWTDMTED ESGFRVEMKE GNSENWKKVV SLSKNTTSYT IKELKPDTLY HFRVVAEKSA YRFEVYSEEI QVLMKSLAAP SNLVVRAVSP NQIVLEWKDN SDDEEGFVIE RKSNNGDFTE IAKVAKNITK FTDNQLYANT TYYYRVKGYY KNTYTNYTNI GMAKTALSKT FNDLNSVPWA KEAIESLAAR GIIHGKSEEQ GIFAPNDRIT RAEFITLIVN AFQLSKTPTG TFADVKPNHW YYRNVMIAKN MGIVSGTGNN YFYPDDPIKR EDMAVILAKT FKIIGKPLPN HSDSVLDKYS DKNLISIYAL QSMAILNGEG IITGKSSSQL SPKDYATRAE AAVILYKALN KL
|
| |