Gene Cthe_0794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0794 
Symbol 
ID4810412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp958188 
End bp959480 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content44% 
IMG OID640106211 
Productaluminium resistance protein 
Protein accessionYP_001037222 
Protein GI125973312 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4100] Cystathionine beta-lyase family protein involved in aluminum resistance 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTG AGTCAAAAAA TTATTTAAAG AATGAATTTG GAATTGATGA CAAGGTACTT 
GAAATTTGTG AATCGGTGAT GAGCAAGATA ACTCCCGTAT TTGACAGGAT TGATTCTGTC
CGGGAGTACA ACCAGTATAA AGTAATCAAG GCAATGCAGA ACAATAAATT GAGCGACTCC
CATTTCGTAG GCACTACAGG ATACGGCTAT GATGACAAGG GCAGAGACGT TTTGGATGAT
GTTTACAGGG ATATTTTCAA AGCAGAGGAT GCTTTGGTCA GACATCAAAT TGTATCCGGA
ACCCATGCTT TGGCCGTATG CCTTTATGGA CATCTCAGGC CGAAAGATGA GCTTTTGGCG
ATTACGGGAA AGCCTTATGA CACATTGGAA GAGGTTATTG GCTTAAGAGG AGAAGGGGGA
GGCTCTTTAA AGGAATTTGG CGTAACATAC CGCCAACTGG ATTTGTTGAA GGATGGCAGT
ATTGATTACG AGTCAATTGA AAGTGCGATA AATGAAAGAA CCGCAATGGT TCTGATTCAA
AGGTCAAGAG GATATGAATG GAGACCGGCT TTGTCCATAG ATGAGATTGA AAAGGCTATA
AATATTGTAA AAAGTATAAA AAAAGATATA GTGGTTCTTG TGGACAATTG CTACGGTGAG
TTTGTTGAAG AGAGGGAGCC CGTAGAGGTT GGAGCGGATC TTGTGGCTGG TTCTCTTATA
AAAAATCCCG GCGGTGGTCT TGCTCCTACG GGAGGATATG TTGCCGGAAG GAAAGAATGT
GTTGAAAAAG CGGCATACAG GCTTACAACT CCGGGACTTG GCAAACATGT GGGAGCATCT
TTGGGACATA ACAGGCTGAT GTTTCAGGGA TTGTTCATGG CGCCGCACGT GGTTGCCGAA
AGCCTTAAGG GAGCGGTATT TTGCGCTGGG GTTATGGAGG CGTTGGGTTT TGAGACAAGT
CCTAAAGTAA ACGACAGAAG GGGTGACATT ATTCAGGCCG TCAGGTTTAA TAACCCCGAA
AGTCTTATTG CTTTTTGCCA GGGAATCCAG AAGGGTTCGC CTGTGGATTC TTTTGTCACA
CCGGAGCCCT GGGACGTGCC CGGCTATGAT TGTCCTGTAA TAATGGCCGC CGGAGCTTTT
ATTCAAGGTT CGTCCATTGA ACTTAGTGCC GATGCGCCGA TTAAATCTCC ATATACTGCT
TATATGCAGG GAGGATTGGT TTTTGAACAT GTAAAGCTTG GAATTATGGT AGCCATACAA
AAAATGCTGG AGAAGGGAAT AATAAAAATC TAA
 
Protein sequence
MNFESKNYLK NEFGIDDKVL EICESVMSKI TPVFDRIDSV REYNQYKVIK AMQNNKLSDS 
HFVGTTGYGY DDKGRDVLDD VYRDIFKAED ALVRHQIVSG THALAVCLYG HLRPKDELLA
ITGKPYDTLE EVIGLRGEGG GSLKEFGVTY RQLDLLKDGS IDYESIESAI NERTAMVLIQ
RSRGYEWRPA LSIDEIEKAI NIVKSIKKDI VVLVDNCYGE FVEEREPVEV GADLVAGSLI
KNPGGGLAPT GGYVAGRKEC VEKAAYRLTT PGLGKHVGAS LGHNRLMFQG LFMAPHVVAE
SLKGAVFCAG VMEALGFETS PKVNDRRGDI IQAVRFNNPE SLIAFCQGIQ KGSPVDSFVT
PEPWDVPGYD CPVIMAAGAF IQGSSIELSA DAPIKSPYTA YMQGGLVFEH VKLGIMVAIQ
KMLEKGIIKI