Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0726 |
Symbol | |
ID | 4810344 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 882024 |
End bp | 883526 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640106143 |
Product | putative aminopeptidase 1 |
Protein accession | YP_001037154 |
Protein GI | 125973244 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1362] Aspartyl aminopeptidase |
TIGRFAM ID | [TIGR01492] Plasmodium falciparum CPW-WPC domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTATTG TGAAAGCATT TAGAGATATA ATAATAAATA TAAAGTTTTT GCAGGAGGGA ATAAATATGT CAGACAATAA AACCGCGGGA CAAAAGTTGT GGGAAAACCT GTCCTACAAC TGCCCCAATG TTTGGAACAA AATAGGTAAT GATGAAATAA AAAACACCTT TGCATTTTGC GAAGAATACA AATCTTTTTT AGACAAATCC AAGACAGAGA GAGAGTTTGT GCGCGAAACT AAAAAGCTGG CTGAAAGCAA AGGTTTTGTT TCAATAGACG ATGTAATAGA TTCCGGAAAA CCGCTAAAAC CCGGAATGAA AGTCTACAGA GTATTCAAAA ACAAAATCGT GGCCCTTGCA GTTATAGGCT CACAGCCGCC GGAAAAAGGT TTTAACCTTG TAGGAGCGCA TATAGACTCA CCAAGAATTG ATCTTAAGCC AAACCCGATA TATGAGGCAA ATGAAATGGT ATTTCTGAAA ACACATTATT ACGGAGGTAT AAAAAAGTAT CAATGGGTAA CCATACCGCT CTCCATGCAT GGAGTGATCA TAAGAAAGGA TGGAAGCAGT GTCGAAATTG TAATCGGCGA AGATGAAAAT GATACAGTGT TTACAATAAC CGATTTGCTT CCGCACCTTG CGGCAGAACA AATGCAGAAA AAAATGAGCG AAGGAATTAC AGGAGAGAAT TTGAATATTC TTTTCGGTTC AATTCCCTAT AACGATGAAA AAGTTAAGGA AAAGGTAAAA TTAAACATAC TCAAGCTGCT TAATGAAAAA TACAATATAA CCGAAGAAGA TTTACTGTCC GCGGAATTGG AATTGGTTCC TTCTTTTAAG GCAAAAGATG TGGGTTTGGA TAAAAGTATG GTCGGTGCCT ACGGACAGGA TGACAGAGTT TGTGCTTACA CAGCTTTAAG AGCCGTCCTG GATTTGGATA ACGTCGACAA AACAGCTGTT TGTGTACTTA CGGACAAAGA GGAAATAGGA AGCATGGGTA ATACCGGAGC CCAGTCAAGC TTTCTTGAAA ACTTCATTGC CGATATATGC GCATTAAGCT CTGAAAAGTA CACGGACATA ATTTTAAGAA GATGCCTGAG CAATTCCAAA ATGCTTTCAG CCGACGTCAA CGCAGCAATA GATCCAACCT ATGAAGGAGT ATATGACAAA CTTAACTCTT CCTTCATCGG AAAAGGAATT GTTCTTTTAA AATATACCGG TGCCCGTGGA AAATCGGGTG CAAGTGATGC CCACGCTGAA TTTATGGGAG AAGTGAGAAA GCTGTTCAAT GAAAGAAAAA TATTCTGGCA AACCGCCGAA CTGGGCAAAG TTGATCAAGG TGGAGGAGGC ACAATTGCCC AATTTGTAGC CAATATGGGA ATGGACGTAA TAGACTGCGG AGTAGCTGTT CTCTCAATGC ATTCTCCGTT TGAAGTAACC AGCAAAGTGG ATATTTATAT GGCATATAAG GCTTACAGGG AGTTCTTGAA GTATATCAAA TAA
|
Protein sequence | MFIVKAFRDI IINIKFLQEG INMSDNKTAG QKLWENLSYN CPNVWNKIGN DEIKNTFAFC EEYKSFLDKS KTEREFVRET KKLAESKGFV SIDDVIDSGK PLKPGMKVYR VFKNKIVALA VIGSQPPEKG FNLVGAHIDS PRIDLKPNPI YEANEMVFLK THYYGGIKKY QWVTIPLSMH GVIIRKDGSS VEIVIGEDEN DTVFTITDLL PHLAAEQMQK KMSEGITGEN LNILFGSIPY NDEKVKEKVK LNILKLLNEK YNITEEDLLS AELELVPSFK AKDVGLDKSM VGAYGQDDRV CAYTALRAVL DLDNVDKTAV CVLTDKEEIG SMGNTGAQSS FLENFIADIC ALSSEKYTDI ILRRCLSNSK MLSADVNAAI DPTYEGVYDK LNSSFIGKGI VLLKYTGARG KSGASDAHAE FMGEVRKLFN ERKIFWQTAE LGKVDQGGGG TIAQFVANMG MDVIDCGVAV LSMHSPFEVT SKVDIYMAYK AYREFLKYIK
|
| |