Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3214 |
Symbol | |
ID | 4809516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3806501 |
End bp | 3808486 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 640108648 |
Product | hypothetical protein |
Protein accession | YP_001039602 |
Protein GI | 125975692 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCAATA ATGAAAATGT TGTGTATAAA TCAAAAGCAC CATATAACTT TATTGGATTG GAAGACTTCA TTTTGGATAA ATGTAGTGAT GAAGAAGAAC TTTTAATGCA TGAAAGGTAT CATGAAGGTT TGAAGACAGG ATGCATTCAG TATGAAATCG AGGTAATAAC ACCTCTTCAT ATTTCTGCAG GTAAAAAAGA ATCTGGTAAA GAGAAAGAAG ACAGTGAGGA AGAAAGTACC AGAGAAGAAG AACTCTTTAA AAATCCTCTT GGGCAATATG TTATTCCGGG GAATACAATT CGTGGCCTCA CAAGGTACAA CGCTTCCATA TTTTCTTTTG CGTCAGTGAT AAATGAGCCA AAAGGAAAGA AAGATATTGA AAACAAAAGA TTTTTTTACA GAACTTTTGC ATCCAAAGAT GCAAATATAA GAAAGTGGTA TTCTGATACT TTGGGAATGA GGCTGCGTCA AAGAGGAAGT TTTAAATACA CTGTTCTTGA AAAGGTTCAT GCCGGTTACA TACGTAAAAG GGGAGAGGAA TATGTTATTA CTCCTGCAGT TAAGATAGGA AACAAAAATG TGGACGATGA AAGCAGAATG AAATCGTATA TGCCAATACA TGAGTGGGAA CTAAGAAATT TAAACGTTCA AGGGGTATAT TATTTATACA ATGACAATCT CAAAAAAGAA GATTACAAAA GTGATTCGAC TTTAAGAGAA AACCGTAACA GTCAGTACAG ACCATACTTT ATAGAAGTAA GATATAATGT TGCGGAGGGA AAACCTAAAA TTGATATTAA CGGAAAGTTC AAAGGTATGC TTGCCAATTC AAATTATATA AATAGCAAGA GGCACCACTA TTTAATATTT GAGGAAGATA AAAACAGTGC TGAAATAGTT GTATCAAAAA AATTGGCAGA ACTTTACACA GATGATTTGA AGTACACGCA AGAGCGCAAT GCTGCAAGCA AAGAGATTTA TAAAGAATAT TATGAACTTC CCAAAGAAGG AGAAGTAAAA CCGGTTTTCT TTGTAAAAGA AGGTGACAGG CTTATTTTTG GTTTTACTCC GTATCTGAGA ATTCCGGCCG AAGGAGATAT ATACGGCGGA ATACCGGAGG TTCACAAAAA CTATACCGGA ACGGATTTTG TTGATGCCAT GTTTGGATGG AGAGATTTTA GAACGAAGCT TTCGTTTTTG GATGCTGTTT GTGAAAGCCC AAATCCTGAA ATAACAAAAG AGTATGAAAT GATGTTGGCG GAGCCAAAAC CGTCATGGTA TAAAGGTTAT CTGAAACAAA GAAAAGACAC TTTGGAATCG TATAGCACAG AAGGGTACGA AATAAGAGGA CGGAAGTTTT ACTGGATGAA AGAAAGCCTT GATATTAAGG GAATGGAAGA ACAAGAAAAG AAAGAAAGAC TTGTTACAAG GATGAAGTGC TATGCTGAAA ATACTAAATT TATAGGAAAA GTTAAATTTG AAAATTTGAC AGATGAGGAA TTGGGACTTT TAATATACTC ATTAAAACAA GGAGACACGG AAGGGTACTT TAACCTGGGT AAAGGAAAAC CTTATGGATT CGGTAAATGC AGGATACGGA TTTTGGGACT TTTTGTGGAA AATATAAAAG AAAAGTACAC TTCTTTTAAT GCTAATTATC TCAAGGAGGA AAAGTCAGAC AAATATGTGG AAGCTTTTAG GAAATACATA ATTGAACATT ACAGAAAAAA GGTGAGCAAT GTAAATGAAA TTATAAGTTA CAGGGAGTTT GAACTTAGTA AAAAGATTTT GAAAAATTCA GAAACCAGAT ATATGAAGGT GGGAGAGTTT GCTCAAAGAG CTGAGCTGCC TATGTTGGAA GATGCTGTCC GAGAGACCGG TAATGTAATA TCTACAAAAA TGAAAGGGGC AGAGAAGAAA AATAGTAATC ACAAGGTAAA AAGTGACAAG CGGAAGAACC ATAATGAATC AAATAAAACA ACATAA
|
Protein sequence | MGNNENVVYK SKAPYNFIGL EDFILDKCSD EEELLMHERY HEGLKTGCIQ YEIEVITPLH ISAGKKESGK EKEDSEEEST REEELFKNPL GQYVIPGNTI RGLTRYNASI FSFASVINEP KGKKDIENKR FFYRTFASKD ANIRKWYSDT LGMRLRQRGS FKYTVLEKVH AGYIRKRGEE YVITPAVKIG NKNVDDESRM KSYMPIHEWE LRNLNVQGVY YLYNDNLKKE DYKSDSTLRE NRNSQYRPYF IEVRYNVAEG KPKIDINGKF KGMLANSNYI NSKRHHYLIF EEDKNSAEIV VSKKLAELYT DDLKYTQERN AASKEIYKEY YELPKEGEVK PVFFVKEGDR LIFGFTPYLR IPAEGDIYGG IPEVHKNYTG TDFVDAMFGW RDFRTKLSFL DAVCESPNPE ITKEYEMMLA EPKPSWYKGY LKQRKDTLES YSTEGYEIRG RKFYWMKESL DIKGMEEQEK KERLVTRMKC YAENTKFIGK VKFENLTDEE LGLLIYSLKQ GDTEGYFNLG KGKPYGFGKC RIRILGLFVE NIKEKYTSFN ANYLKEEKSD KYVEAFRKYI IEHYRKKVSN VNEIISYREF ELSKKILKNS ETRYMKVGEF AQRAELPMLE DAVRETGNVI STKMKGAEKK NSNHKVKSDK RKNHNESNKT T
|
| |