Gene Cthe_0316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0316 
Symbol 
ID4808534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp398665 
End bp400599 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content42% 
IMG OID640105727 
ProductPA14 
Protein accessionYP_001036747 
Protein GI125972837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.914268 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAACA TAGGAGTAAT CATTAAGATA GAAGGAAACG AAGCCATTGT AATGACCGAC 
GATTGCTCTT TCAAAAAGGT TCCGATAAAA GATGGAATGC ATCCGGGGCA AAAAATACTT
GTGCCCAATA ATGAAGTTAT ACAGAAGGAA AATAAAAGCA TAAAGCGGAT TTCGGCTGTC
GCGACCGGCA TTGCAGCCGT GTTTTTGATG GTGTTGTCGT TAATATGGAT TAACAAACCG
GGCAGACCGG ATGGTATATA TGCATATATT GACGTTGATA TAAATCCCAG TTTAAACTTC
CTGATTGACC GGGAGGGAAA GGTAAAGGCG TTAAACCCGT TAAATGATGA TGCGCAGGAA
ATAATCCGTG GTGTTGAGTT TGAGGATATG TTTTTTTCAG AAGCCCTTAC GCAGATTATC
AAGATATCAA AAGCCAAAGG TATTATAGAT GAAAACAAAA CCAATTATGT ACTGATTTGT
GCAGCTTTGG ACGATAATTA CAATTTGCAA AGCGACGACA AATCCCGGGC GCAAACAGAG
TTTGAAGAGT TTTTGGACGG TATTAGGGAA AGTATAGAGA AAGCCTGCGG CAATACGGTA
ATTCCTCAAA CGGTAAAAGT ACCGTTTGAA TACTTAAAAA TGGCAAAGCA AAATGATGTA
TCCATGGGAA GGTATCTGGT TTATCAGAAG TTGGAGGACA TTGGAGTGAA TTTGTCGATA
GAAGAGCTGA AATCATTGGA TATCGATGAA ATATTAAAAA AATATGGTGT GGGTTTTGAT
GAATTGTTCA AAAGTGAGTA TACGGAATTG CCGTATGGGA CTTTGCAAAC AGGAGAAGAT
TCTGTTGTGT CTACAGAGGA TGTGCCGGTA TCGCCGAAAA ATGCATTTGA AACGATGGCT
GTGCCGACAA ATACGCCTTC AATATCGACT AAACCTTCAG CAACCCCGGC GGAGAATCCG
ACGCCAAAAT TAACGCAGAA ACCAACGCCT GTACCGGCAA AAACAGGTGA ACGTACAAGC
ACAACGCCGA CACCGACACC GGCGCCAACC GTCAGAAACG GTACCGGCAG CGGACTTAGG
GGAGAGTATT ACAATAATAT GGATTTTTCC CGTTTCCAGT TTGTGAGAAT TGATCCCTGT
ATAGACTTTG ACTGGGGTGA AGGCACACCG GATCAATCCA TCGGAAAGGA TACCTATTCT
GTCAGATGGA CAGGGAAGGT TGAACCTAGA TATTCGGAAA CATACACATT TTATACTGTT
ACCGATGACG GTGTGAGATT GTGGGTAGAC GGAGTGCTGC TCATTGACAA GTGGAAGAGC
CAGTCGGCTA CTGAACACAG CGAGCAAATT TATCTCGAGG CCGGAAAGAA ATATGATATT
AAAATGGAGT ATTACCAGCA TGTCCGGGCT GCTTCGGCAA AACTTATGTG GTCAAGCAAG
AGCCAGCAAA AGGAGATAAT ACCTTCAAGT CAACTGTATC CTTCCGACGG CCCGCTGCCT
CAGAAGGATG TAAACGGTTT GAGTGCGGAA TATTACGGGG ATGCGGAGTT GAAAGACAAG
AGATTTACCA GAATAGACGA TGCTATAAAC TTTAACTGGG ATAAGGATTT TCCGGTTGGT
GAATTGAAAG ACGGAAAGTT TTCGGTAAGA TGGGTGGGAA AAATAGACAC CAGATATACC
GAAGAGTATA CGTTCCATAC TGTTGCAAAC GGAGGAGTAA GGGTATGGAT AAATAATGTG
TTGATAATTG ACAATTGGCA AAATCAGGGC AAAGAAGCTG AAAACAGCGG AAAAATTGAA
TTAAAGGCAG GAAGGCAGTA TGATATTAAA GTTGAGTATT GCAACTACGG AGAACCTGCA
TTCATAAAGC TTTTATGGTC CAGTCAAAGA CAGAAAAAAG AGGTGGTTCC TTCAAAAAAT
TTGTTTGCAG ATTAA
 
Protein sequence
MDNIGVIIKI EGNEAIVMTD DCSFKKVPIK DGMHPGQKIL VPNNEVIQKE NKSIKRISAV 
ATGIAAVFLM VLSLIWINKP GRPDGIYAYI DVDINPSLNF LIDREGKVKA LNPLNDDAQE
IIRGVEFEDM FFSEALTQII KISKAKGIID ENKTNYVLIC AALDDNYNLQ SDDKSRAQTE
FEEFLDGIRE SIEKACGNTV IPQTVKVPFE YLKMAKQNDV SMGRYLVYQK LEDIGVNLSI
EELKSLDIDE ILKKYGVGFD ELFKSEYTEL PYGTLQTGED SVVSTEDVPV SPKNAFETMA
VPTNTPSIST KPSATPAENP TPKLTQKPTP VPAKTGERTS TTPTPTPAPT VRNGTGSGLR
GEYYNNMDFS RFQFVRIDPC IDFDWGEGTP DQSIGKDTYS VRWTGKVEPR YSETYTFYTV
TDDGVRLWVD GVLLIDKWKS QSATEHSEQI YLEAGKKYDI KMEYYQHVRA ASAKLMWSSK
SQQKEIIPSS QLYPSDGPLP QKDVNGLSAE YYGDAELKDK RFTRIDDAIN FNWDKDFPVG
ELKDGKFSVR WVGKIDTRYT EEYTFHTVAN GGVRVWINNV LIIDNWQNQG KEAENSGKIE
LKAGRQYDIK VEYCNYGEPA FIKLLWSSQR QKKEVVPSKN LFAD