Gene Cthe_0459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0459 
Symbol 
ID4808387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp572939 
End bp574051 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content39% 
IMG OID640105873 
ProductDNA protecting protein DprA 
Protein accessionYP_001036890 
Protein GI125972980 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000016133 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGTA ATTTGAAAGA GTGGGTATGG CTAAGTTCCA TTCCCGGAAT TGGAGCAGTC 
AAGTCCAGAA AACTTCTGGA GCATTTTGGG GATATACATA ATGTTTGGAG TGCCACTGCG
GCTGAATTGG CTGTGCTTCC TTTTTTAAAC AGGAAAGATA TTTTAAATTT AACCAATGTA
AAATTCAAGC AAGACGTTGA GCGGCATCTT GAAAATATCC ACAAAAATGA TATCAAGGTT
ATTACATTGG AAGATGAATT GTATCCTGCG TACCTTAAAA ACATATATGA TCCTCCTTTG
GTTCTTTATA TGAAGGGAAC CATTCAAGAG GAGGAAAAAT ATCTGGCTGT GGTTGGTTCA
AGAAGAGCGA CGTCCTATGG ACTGGATATG GCGAAGAAAA TATCCCGAGA GCTTGCTGAA
TGCGGTATTA CCGTTGTAAG CGGCATGGCG AGGGGAGTTG ATTCTTTTGC CCATATGGGA
GCTCTTGAAG TAAAAGGAAG GACCATAGCC GTTTTAGGGT GCGGTCTTGA TATAGTATAT
CCATACGAAA ATAAAAAACT TATGGAAAAT ATAATTGAAA GCGGTGCCTG CCTGTCCGAG
TACCTTCCGG GTACTACGCC GGTGCCGGGC AATTTTCCCG CGCGCAACAG GATTATCAGT
GGTATTTCAC TGGGAGTTGT TGTAATAGAG GCGGGAGAGC GCAGCGGTTC CTTGATTACG
GCAAATTTTG CTTTGGAGCA GGGAAGAGAA GTTTTTGCGC TTCCGGGAAA TGTCAACAGT
ATTAAAAGTA CCGGAACCAA TAAATTAATA AAAGAAGGGG CAAAAATAGT AACAGGAATT
GACGATATAT TAGAAGAATT AAATATTTAT TTCATCGAAG AAAATACAAA AGTCTCCTTT
AACAAAAATC TTCAGGATGA AAGAATTTTA AGAGGCCTTG ACAATGATGA AAAGAAAGTT
GTGGAATGTT TGAAACTTGA GTCAATGCAT ATTGACAATA TTGCAAGAAA AACCGGATTT
GGCATACAGC TTGTTAATTC AATACTTGTA ATGCTCGAAT TAAAGGGAGT TGTTGAGCAG
CTTCCAGGAA AGATATTCAA GTTAAAGTTG TAA
 
Protein sequence
MESNLKEWVW LSSIPGIGAV KSRKLLEHFG DIHNVWSATA AELAVLPFLN RKDILNLTNV 
KFKQDVERHL ENIHKNDIKV ITLEDELYPA YLKNIYDPPL VLYMKGTIQE EEKYLAVVGS
RRATSYGLDM AKKISRELAE CGITVVSGMA RGVDSFAHMG ALEVKGRTIA VLGCGLDIVY
PYENKKLMEN IIESGACLSE YLPGTTPVPG NFPARNRIIS GISLGVVVIE AGERSGSLIT
ANFALEQGRE VFALPGNVNS IKSTGTNKLI KEGAKIVTGI DDILEELNIY FIEENTKVSF
NKNLQDERIL RGLDNDEKKV VECLKLESMH IDNIARKTGF GIQLVNSILV MLELKGVVEQ
LPGKIFKLKL