Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2863 |
Symbol | |
ID | 4809143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3381065 |
End bp | 3382069 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640108282 |
Product | hypothetical protein |
Protein accession | YP_001039254 |
Protein GI | 125975344 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCACAG CACTTTCTTT TAGTATGGAT GGATTTGACA TGCTGGCAAT GGGAATATCG TTGTTTGAAC CATCCAATGC ATTGGTTGAA TTTAACCGGA AGCTGCATTC CAATGCACTT TATAACGGAT TCCAGATTGC TGTAAACGCG CTGGCTGTTT TCAGTGCCGG GGCGGCATCG ACAATGAAGT GCTTTGTTGC AGGTACAATG ATATTGACTG CGACAGGCTT GGTTGCGATA GAGAATATCA AGGCAGGGGA CAAGGTAATT GCGACGAATC CTGAAACTTT TGAAGTAGCC GAGAAGACAG TGCTTGAGAC ATATGTGAGA GATACGACGG AGCTTTTGCA TTTGACAATC AATGGAGAGG TAATCAAGAC AACCTTTGAG CATCCGTTTT ATGTAAAAGA TGTGGGTTTT GTTGAAGCGG GAAAACTGCA GATAGGAGAC AGGTTGGTTG ATTCAAGAGG TAATGTTTTA GTATTGGAAG GTAAAAAGCT TGAAATAACA GATAAGCCTG TAAAGGTTTA CAATTTTAAG GTTGATAATT TTCATACGTA TCATGTTGGC GAAAATAGGG TATTGGTTCA TAATGCGAAT AAGTATGTTA AGGGAACGCG TAGTACTGTA GGTAAACTTA CAGGTTCATT GGATGGGTTA ACATCAGCAG AAAGAAAGGT TGTAAATGAT TTGCTTTCAC AGGGTAAGAA TGTTGAAATA ATTCCGCGTT CCAATGTTCA AGGGGTTAGC ACACCTGATT TTATAATAAA TGGGGTAAAA ACAGAATTAA AAACATTAAA TGGAACAAGT CTAAATACTC CGGTTACTAG GATTACAGAT GCGTTTAAAC AAGGTGCAGA TGCAGTTATT ATTGATGCAA GAAATGTTGG AATAACTGCT GAACAGGCAA ACCAAATACT CAATCGAGCT GCAGGCACTT ATCAAAATAA AGTATTACCA GGTCAAGTTG AGATTTGGAC TGTTGACGGT ATTATTAGGA GGTAA
|
Protein sequence | MTTALSFSMD GFDMLAMGIS LFEPSNALVE FNRKLHSNAL YNGFQIAVNA LAVFSAGAAS TMKCFVAGTM ILTATGLVAI ENIKAGDKVI ATNPETFEVA EKTVLETYVR DTTELLHLTI NGEVIKTTFE HPFYVKDVGF VEAGKLQIGD RLVDSRGNVL VLEGKKLEIT DKPVKVYNFK VDNFHTYHVG ENRVLVHNAN KYVKGTRSTV GKLTGSLDGL TSAERKVVND LLSQGKNVEI IPRSNVQGVS TPDFIINGVK TELKTLNGTS LNTPVTRITD AFKQGADAVI IDARNVGITA EQANQILNRA AGTYQNKVLP GQVEIWTVDG IIRR
|
| |