Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0316 |
Symbol | |
ID | 4808534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 398665 |
End bp | 400599 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640105727 |
Product | PA14 |
Protein accession | YP_001036747 |
Protein GI | 125972837 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.914268 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAACA TAGGAGTAAT CATTAAGATA GAAGGAAACG AAGCCATTGT AATGACCGAC GATTGCTCTT TCAAAAAGGT TCCGATAAAA GATGGAATGC ATCCGGGGCA AAAAATACTT GTGCCCAATA ATGAAGTTAT ACAGAAGGAA AATAAAAGCA TAAAGCGGAT TTCGGCTGTC GCGACCGGCA TTGCAGCCGT GTTTTTGATG GTGTTGTCGT TAATATGGAT TAACAAACCG GGCAGACCGG ATGGTATATA TGCATATATT GACGTTGATA TAAATCCCAG TTTAAACTTC CTGATTGACC GGGAGGGAAA GGTAAAGGCG TTAAACCCGT TAAATGATGA TGCGCAGGAA ATAATCCGTG GTGTTGAGTT TGAGGATATG TTTTTTTCAG AAGCCCTTAC GCAGATTATC AAGATATCAA AAGCCAAAGG TATTATAGAT GAAAACAAAA CCAATTATGT ACTGATTTGT GCAGCTTTGG ACGATAATTA CAATTTGCAA AGCGACGACA AATCCCGGGC GCAAACAGAG TTTGAAGAGT TTTTGGACGG TATTAGGGAA AGTATAGAGA AAGCCTGCGG CAATACGGTA ATTCCTCAAA CGGTAAAAGT ACCGTTTGAA TACTTAAAAA TGGCAAAGCA AAATGATGTA TCCATGGGAA GGTATCTGGT TTATCAGAAG TTGGAGGACA TTGGAGTGAA TTTGTCGATA GAAGAGCTGA AATCATTGGA TATCGATGAA ATATTAAAAA AATATGGTGT GGGTTTTGAT GAATTGTTCA AAAGTGAGTA TACGGAATTG CCGTATGGGA CTTTGCAAAC AGGAGAAGAT TCTGTTGTGT CTACAGAGGA TGTGCCGGTA TCGCCGAAAA ATGCATTTGA AACGATGGCT GTGCCGACAA ATACGCCTTC AATATCGACT AAACCTTCAG CAACCCCGGC GGAGAATCCG ACGCCAAAAT TAACGCAGAA ACCAACGCCT GTACCGGCAA AAACAGGTGA ACGTACAAGC ACAACGCCGA CACCGACACC GGCGCCAACC GTCAGAAACG GTACCGGCAG CGGACTTAGG GGAGAGTATT ACAATAATAT GGATTTTTCC CGTTTCCAGT TTGTGAGAAT TGATCCCTGT ATAGACTTTG ACTGGGGTGA AGGCACACCG GATCAATCCA TCGGAAAGGA TACCTATTCT GTCAGATGGA CAGGGAAGGT TGAACCTAGA TATTCGGAAA CATACACATT TTATACTGTT ACCGATGACG GTGTGAGATT GTGGGTAGAC GGAGTGCTGC TCATTGACAA GTGGAAGAGC CAGTCGGCTA CTGAACACAG CGAGCAAATT TATCTCGAGG CCGGAAAGAA ATATGATATT AAAATGGAGT ATTACCAGCA TGTCCGGGCT GCTTCGGCAA AACTTATGTG GTCAAGCAAG AGCCAGCAAA AGGAGATAAT ACCTTCAAGT CAACTGTATC CTTCCGACGG CCCGCTGCCT CAGAAGGATG TAAACGGTTT GAGTGCGGAA TATTACGGGG ATGCGGAGTT GAAAGACAAG AGATTTACCA GAATAGACGA TGCTATAAAC TTTAACTGGG ATAAGGATTT TCCGGTTGGT GAATTGAAAG ACGGAAAGTT TTCGGTAAGA TGGGTGGGAA AAATAGACAC CAGATATACC GAAGAGTATA CGTTCCATAC TGTTGCAAAC GGAGGAGTAA GGGTATGGAT AAATAATGTG TTGATAATTG ACAATTGGCA AAATCAGGGC AAAGAAGCTG AAAACAGCGG AAAAATTGAA TTAAAGGCAG GAAGGCAGTA TGATATTAAA GTTGAGTATT GCAACTACGG AGAACCTGCA TTCATAAAGC TTTTATGGTC CAGTCAAAGA CAGAAAAAAG AGGTGGTTCC TTCAAAAAAT TTGTTTGCAG ATTAA
|
Protein sequence | MDNIGVIIKI EGNEAIVMTD DCSFKKVPIK DGMHPGQKIL VPNNEVIQKE NKSIKRISAV ATGIAAVFLM VLSLIWINKP GRPDGIYAYI DVDINPSLNF LIDREGKVKA LNPLNDDAQE IIRGVEFEDM FFSEALTQII KISKAKGIID ENKTNYVLIC AALDDNYNLQ SDDKSRAQTE FEEFLDGIRE SIEKACGNTV IPQTVKVPFE YLKMAKQNDV SMGRYLVYQK LEDIGVNLSI EELKSLDIDE ILKKYGVGFD ELFKSEYTEL PYGTLQTGED SVVSTEDVPV SPKNAFETMA VPTNTPSIST KPSATPAENP TPKLTQKPTP VPAKTGERTS TTPTPTPAPT VRNGTGSGLR GEYYNNMDFS RFQFVRIDPC IDFDWGEGTP DQSIGKDTYS VRWTGKVEPR YSETYTFYTV TDDGVRLWVD GVLLIDKWKS QSATEHSEQI YLEAGKKYDI KMEYYQHVRA ASAKLMWSSK SQQKEIIPSS QLYPSDGPLP QKDVNGLSAE YYGDAELKDK RFTRIDDAIN FNWDKDFPVG ELKDGKFSVR WVGKIDTRYT EEYTFHTVAN GGVRVWINNV LIIDNWQNQG KEAENSGKIE LKAGRQYDIK VEYCNYGEPA FIKLLWSSQR QKKEVVPSKN LFAD
|
| |