Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0459 |
Symbol | |
ID | 4808387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 572939 |
End bp | 574051 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640105873 |
Product | DNA protecting protein DprA |
Protein accession | YP_001036890 |
Protein GI | 125972980 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000016133 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAGTA ATTTGAAAGA GTGGGTATGG CTAAGTTCCA TTCCCGGAAT TGGAGCAGTC AAGTCCAGAA AACTTCTGGA GCATTTTGGG GATATACATA ATGTTTGGAG TGCCACTGCG GCTGAATTGG CTGTGCTTCC TTTTTTAAAC AGGAAAGATA TTTTAAATTT AACCAATGTA AAATTCAAGC AAGACGTTGA GCGGCATCTT GAAAATATCC ACAAAAATGA TATCAAGGTT ATTACATTGG AAGATGAATT GTATCCTGCG TACCTTAAAA ACATATATGA TCCTCCTTTG GTTCTTTATA TGAAGGGAAC CATTCAAGAG GAGGAAAAAT ATCTGGCTGT GGTTGGTTCA AGAAGAGCGA CGTCCTATGG ACTGGATATG GCGAAGAAAA TATCCCGAGA GCTTGCTGAA TGCGGTATTA CCGTTGTAAG CGGCATGGCG AGGGGAGTTG ATTCTTTTGC CCATATGGGA GCTCTTGAAG TAAAAGGAAG GACCATAGCC GTTTTAGGGT GCGGTCTTGA TATAGTATAT CCATACGAAA ATAAAAAACT TATGGAAAAT ATAATTGAAA GCGGTGCCTG CCTGTCCGAG TACCTTCCGG GTACTACGCC GGTGCCGGGC AATTTTCCCG CGCGCAACAG GATTATCAGT GGTATTTCAC TGGGAGTTGT TGTAATAGAG GCGGGAGAGC GCAGCGGTTC CTTGATTACG GCAAATTTTG CTTTGGAGCA GGGAAGAGAA GTTTTTGCGC TTCCGGGAAA TGTCAACAGT ATTAAAAGTA CCGGAACCAA TAAATTAATA AAAGAAGGGG CAAAAATAGT AACAGGAATT GACGATATAT TAGAAGAATT AAATATTTAT TTCATCGAAG AAAATACAAA AGTCTCCTTT AACAAAAATC TTCAGGATGA AAGAATTTTA AGAGGCCTTG ACAATGATGA AAAGAAAGTT GTGGAATGTT TGAAACTTGA GTCAATGCAT ATTGACAATA TTGCAAGAAA AACCGGATTT GGCATACAGC TTGTTAATTC AATACTTGTA ATGCTCGAAT TAAAGGGAGT TGTTGAGCAG CTTCCAGGAA AGATATTCAA GTTAAAGTTG TAA
|
Protein sequence | MESNLKEWVW LSSIPGIGAV KSRKLLEHFG DIHNVWSATA AELAVLPFLN RKDILNLTNV KFKQDVERHL ENIHKNDIKV ITLEDELYPA YLKNIYDPPL VLYMKGTIQE EEKYLAVVGS RRATSYGLDM AKKISRELAE CGITVVSGMA RGVDSFAHMG ALEVKGRTIA VLGCGLDIVY PYENKKLMEN IIESGACLSE YLPGTTPVPG NFPARNRIIS GISLGVVVIE AGERSGSLIT ANFALEQGRE VFALPGNVNS IKSTGTNKLI KEGAKIVTGI DDILEELNIY FIEENTKVSF NKNLQDERIL RGLDNDEKKV VECLKLESMH IDNIARKTGF GIQLVNSILV MLELKGVVEQ LPGKIFKLKL
|
| |