Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0794 |
Symbol | |
ID | 4810412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 958188 |
End bp | 959480 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106211 |
Product | aluminium resistance protein |
Protein accession | YP_001037222 |
Protein GI | 125973312 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4100] Cystathionine beta-lyase family protein involved in aluminum resistance |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTTG AGTCAAAAAA TTATTTAAAG AATGAATTTG GAATTGATGA CAAGGTACTT GAAATTTGTG AATCGGTGAT GAGCAAGATA ACTCCCGTAT TTGACAGGAT TGATTCTGTC CGGGAGTACA ACCAGTATAA AGTAATCAAG GCAATGCAGA ACAATAAATT GAGCGACTCC CATTTCGTAG GCACTACAGG ATACGGCTAT GATGACAAGG GCAGAGACGT TTTGGATGAT GTTTACAGGG ATATTTTCAA AGCAGAGGAT GCTTTGGTCA GACATCAAAT TGTATCCGGA ACCCATGCTT TGGCCGTATG CCTTTATGGA CATCTCAGGC CGAAAGATGA GCTTTTGGCG ATTACGGGAA AGCCTTATGA CACATTGGAA GAGGTTATTG GCTTAAGAGG AGAAGGGGGA GGCTCTTTAA AGGAATTTGG CGTAACATAC CGCCAACTGG ATTTGTTGAA GGATGGCAGT ATTGATTACG AGTCAATTGA AAGTGCGATA AATGAAAGAA CCGCAATGGT TCTGATTCAA AGGTCAAGAG GATATGAATG GAGACCGGCT TTGTCCATAG ATGAGATTGA AAAGGCTATA AATATTGTAA AAAGTATAAA AAAAGATATA GTGGTTCTTG TGGACAATTG CTACGGTGAG TTTGTTGAAG AGAGGGAGCC CGTAGAGGTT GGAGCGGATC TTGTGGCTGG TTCTCTTATA AAAAATCCCG GCGGTGGTCT TGCTCCTACG GGAGGATATG TTGCCGGAAG GAAAGAATGT GTTGAAAAAG CGGCATACAG GCTTACAACT CCGGGACTTG GCAAACATGT GGGAGCATCT TTGGGACATA ACAGGCTGAT GTTTCAGGGA TTGTTCATGG CGCCGCACGT GGTTGCCGAA AGCCTTAAGG GAGCGGTATT TTGCGCTGGG GTTATGGAGG CGTTGGGTTT TGAGACAAGT CCTAAAGTAA ACGACAGAAG GGGTGACATT ATTCAGGCCG TCAGGTTTAA TAACCCCGAA AGTCTTATTG CTTTTTGCCA GGGAATCCAG AAGGGTTCGC CTGTGGATTC TTTTGTCACA CCGGAGCCCT GGGACGTGCC CGGCTATGAT TGTCCTGTAA TAATGGCCGC CGGAGCTTTT ATTCAAGGTT CGTCCATTGA ACTTAGTGCC GATGCGCCGA TTAAATCTCC ATATACTGCT TATATGCAGG GAGGATTGGT TTTTGAACAT GTAAAGCTTG GAATTATGGT AGCCATACAA AAAATGCTGG AGAAGGGAAT AATAAAAATC TAA
|
Protein sequence | MNFESKNYLK NEFGIDDKVL EICESVMSKI TPVFDRIDSV REYNQYKVIK AMQNNKLSDS HFVGTTGYGY DDKGRDVLDD VYRDIFKAED ALVRHQIVSG THALAVCLYG HLRPKDELLA ITGKPYDTLE EVIGLRGEGG GSLKEFGVTY RQLDLLKDGS IDYESIESAI NERTAMVLIQ RSRGYEWRPA LSIDEIEKAI NIVKSIKKDI VVLVDNCYGE FVEEREPVEV GADLVAGSLI KNPGGGLAPT GGYVAGRKEC VEKAAYRLTT PGLGKHVGAS LGHNRLMFQG LFMAPHVVAE SLKGAVFCAG VMEALGFETS PKVNDRRGDI IQAVRFNNPE SLIAFCQGIQ KGSPVDSFVT PEPWDVPGYD CPVIMAAGAF IQGSSIELSA DAPIKSPYTA YMQGGLVFEH VKLGIMVAIQ KMLEKGIIKI
|
| |