Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2214 |
Symbol | |
ID | 4811079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2642982 |
End bp | 2643923 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640107620 |
Product | hypothetical protein |
Protein accession | YP_001038609 |
Protein GI | 125974699 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATTA CTGTTTTGAC CTATTCACGC CAAAAAAGTC ATATAATAAG AAAAATCATA TTGTTTATTG TTTTGCTCGC GTTGATTTTT TCTGTTGTGG TTTCGGCTGT GTCGGTTATT GCGGGATGGA AACTGATTCA TCCTAAAAGA TTGAATATTT TGGACTTTTC CGCCAATATC GTACCTTCTT ATACTGATGT TTCTTTCAAA GACATAAACG ATGAGTTTGA ATTAAAAGGA TGGTACTTTA ATGTAACCGG AAGCAGCAAG ACAGTAATCC TCGCCCATGG ATACGGAAAA AACAGGCTGA ATTTTGGCGA GAACACCATA CATCTCATAA AGAGCCTTCT TGACAAAGGA TACAACGTAC TTGCCTTTGA TTTCAGAAAC TCCGGAGAAT CCGAGGGAAA TAAAACTACT TTCGGAGTTT GTGAAAAGAA TGATTTGCTT GGTGCAATTC AATACGTTAA AAACAAAGGT TCGGAGAAAA TAGTTCTCAT GGGTTTCTCA ACAGGAGCAT CGGCATGTAT TCTCGCAGCT GCCGAAAGCG ACGATGTTGA CGCAGTAATA GCAGAAAGCC CATACTCTGA CCTTAACACT TATTTTGAAC AAAACGTAAA CAATTTAACC AACTTTCCGG CAATACCGTT TAACAAAACA ATAACATTTG CAACATTTTT TCTGGCAGAT ATCAAGCCCG ACGAAGCCAG CCCTGTAAAA GCCGTACAGG CTGTTTCTCC GCGCCCTGTG CTTCTAATCC ACAGCAAGGA TGACACCAAA GTGCCGGTTG AAAACAGCCG CCTGATATAC AAGGCATCCA ATCCTTACAC CACCACGTTT TGGGAAACAA GCGGTGCAGA TCATGAGGAA ATTTATCAGG CAAATCCCGA AGAATACGTA AAAAAGGTAA CGGACTTTCT TGAAAAACTG TCACAGACTT AA
|
Protein sequence | MRITVLTYSR QKSHIIRKII LFIVLLALIF SVVVSAVSVI AGWKLIHPKR LNILDFSANI VPSYTDVSFK DINDEFELKG WYFNVTGSSK TVILAHGYGK NRLNFGENTI HLIKSLLDKG YNVLAFDFRN SGESEGNKTT FGVCEKNDLL GAIQYVKNKG SEKIVLMGFS TGASACILAA AESDDVDAVI AESPYSDLNT YFEQNVNNLT NFPAIPFNKT ITFATFFLAD IKPDEASPVK AVQAVSPRPV LLIHSKDDTK VPVENSRLIY KASNPYTTTF WETSGADHEE IYQANPEEYV KKVTDFLEKL SQT
|
| |