Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3028 |
Symbol | |
ID | 4811100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 3551120 |
End bp | 3552316 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640108449 |
Product | histidine decarboxylase |
Protein accession | YP_001039417 |
Protein GI | 125975507 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0076] Glutamate decarboxylase and related PLP-dependent proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.726503 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAACAC AGCAAAATTT ATTTATTAAT TCAATATTTG ATACTTCTGT TACTGCAGAT TTGAAATCAA ATAGTGAAAG TTTACTTGAA GAAATCGATA AAATTTACGA AGATTTATAC GAAAAGCACC AGCATTATCT TGGATATCCA TTCAATTTAA ATCTGGAATA CGCTGAGTTT GGCAAATTTC TAAATTTACA GCCCAACAAT CTCGGCGATG CATTCTATTC TTCAACAGTT AACATTGATA CAAAAAAGCA AGAACGGGAA GTCTTAAAAT TTTTTGCAGA TGTATACAAA CTTCCCTGGG AAGAAGCCTG GGGTTACATA GGACACGGCG GCACCGAAGG AAACCTCTGT GGAATGTTGG TTGCCCGCGA ACGTTATCCT GATGGAATAT TCTATTTCTC TGAAGCCAGT CATTACAGTA TAAAAAAGAA TGCCTGGATA TTGGGCAAGC CGGGTGAGGT GATTCCATCC CAGCCCAATG GAGAATTTGA TTATAATGCA TTAATTGAGC GAATACTTAA AAACGGCAAC AAACCGGTTT TACTTGTCGC AACATTGGGA ACAACCATGA CCGGTGCCAT TGATAATGTC CAAATTATTG TTGACTTGTT CAAAAAGCAC AATATCAAAG AGTATCATAT TCATTATGAC GGTGCGCTTT TCGGCGGAAT GATACCGTTT ATGGAAAATG GACCGGAACT TAATTTTGAA ACCCTGCCCA TCGATTCTAT CGCAATCAGT GGTCATAAAT TTGTAGGTTG TCCCATGCCG GCAGGTATTT TCCTCACACG TAAAAAATAC ATTCAAAAAA TCCTTGAAAA TTCAGATGTA TCTTACGTAG GCACCAAAGA CACTACTATA AGCGGTTGTC GCAACGGGCT TTCAGCATTG CTTCTTTGGT ATCAGATAAA CCGCAAAGGT GTGGAAGGAT TCAAACAGGA TGTAAGACAA TGCATGGAAG TGACCGCATA TGCCAAGGCC AGATTGGATT CTATCGGCTG GAATAACTTT GTGAATCCAT GGTCAAACAC TATTGTAATA GACAAACCAA ACGATGCAAT ATGCAATTAC TGGTCTTTGG CCTGCGAAGG AGATAAAGCA CACATAATAA TCATGCAGCA TGTTACAAAA GAACATATTG ATTTGTTCAT CGAACATTTA CTAAACAGCA AATATACCAT AAATTAA
|
Protein sequence | MSTQQNLFIN SIFDTSVTAD LKSNSESLLE EIDKIYEDLY EKHQHYLGYP FNLNLEYAEF GKFLNLQPNN LGDAFYSSTV NIDTKKQERE VLKFFADVYK LPWEEAWGYI GHGGTEGNLC GMLVARERYP DGIFYFSEAS HYSIKKNAWI LGKPGEVIPS QPNGEFDYNA LIERILKNGN KPVLLVATLG TTMTGAIDNV QIIVDLFKKH NIKEYHIHYD GALFGGMIPF MENGPELNFE TLPIDSIAIS GHKFVGCPMP AGIFLTRKKY IQKILENSDV SYVGTKDTTI SGCRNGLSAL LLWYQINRKG VEGFKQDVRQ CMEVTAYAKA RLDSIGWNNF VNPWSNTIVI DKPNDAICNY WSLACEGDKA HIIIMQHVTK EHIDLFIEHL LNSKYTIN
|
| |