Gene Cthe_0726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0726 
Symbol 
ID4810344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp882024 
End bp883526 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content39% 
IMG OID640106143 
Productputative aminopeptidase 1 
Protein accessionYP_001037154 
Protein GI125973244 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1362] Aspartyl aminopeptidase 
TIGRFAM ID[TIGR01492] Plasmodium falciparum CPW-WPC domain 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTATTG TGAAAGCATT TAGAGATATA ATAATAAATA TAAAGTTTTT GCAGGAGGGA 
ATAAATATGT CAGACAATAA AACCGCGGGA CAAAAGTTGT GGGAAAACCT GTCCTACAAC
TGCCCCAATG TTTGGAACAA AATAGGTAAT GATGAAATAA AAAACACCTT TGCATTTTGC
GAAGAATACA AATCTTTTTT AGACAAATCC AAGACAGAGA GAGAGTTTGT GCGCGAAACT
AAAAAGCTGG CTGAAAGCAA AGGTTTTGTT TCAATAGACG ATGTAATAGA TTCCGGAAAA
CCGCTAAAAC CCGGAATGAA AGTCTACAGA GTATTCAAAA ACAAAATCGT GGCCCTTGCA
GTTATAGGCT CACAGCCGCC GGAAAAAGGT TTTAACCTTG TAGGAGCGCA TATAGACTCA
CCAAGAATTG ATCTTAAGCC AAACCCGATA TATGAGGCAA ATGAAATGGT ATTTCTGAAA
ACACATTATT ACGGAGGTAT AAAAAAGTAT CAATGGGTAA CCATACCGCT CTCCATGCAT
GGAGTGATCA TAAGAAAGGA TGGAAGCAGT GTCGAAATTG TAATCGGCGA AGATGAAAAT
GATACAGTGT TTACAATAAC CGATTTGCTT CCGCACCTTG CGGCAGAACA AATGCAGAAA
AAAATGAGCG AAGGAATTAC AGGAGAGAAT TTGAATATTC TTTTCGGTTC AATTCCCTAT
AACGATGAAA AAGTTAAGGA AAAGGTAAAA TTAAACATAC TCAAGCTGCT TAATGAAAAA
TACAATATAA CCGAAGAAGA TTTACTGTCC GCGGAATTGG AATTGGTTCC TTCTTTTAAG
GCAAAAGATG TGGGTTTGGA TAAAAGTATG GTCGGTGCCT ACGGACAGGA TGACAGAGTT
TGTGCTTACA CAGCTTTAAG AGCCGTCCTG GATTTGGATA ACGTCGACAA AACAGCTGTT
TGTGTACTTA CGGACAAAGA GGAAATAGGA AGCATGGGTA ATACCGGAGC CCAGTCAAGC
TTTCTTGAAA ACTTCATTGC CGATATATGC GCATTAAGCT CTGAAAAGTA CACGGACATA
ATTTTAAGAA GATGCCTGAG CAATTCCAAA ATGCTTTCAG CCGACGTCAA CGCAGCAATA
GATCCAACCT ATGAAGGAGT ATATGACAAA CTTAACTCTT CCTTCATCGG AAAAGGAATT
GTTCTTTTAA AATATACCGG TGCCCGTGGA AAATCGGGTG CAAGTGATGC CCACGCTGAA
TTTATGGGAG AAGTGAGAAA GCTGTTCAAT GAAAGAAAAA TATTCTGGCA AACCGCCGAA
CTGGGCAAAG TTGATCAAGG TGGAGGAGGC ACAATTGCCC AATTTGTAGC CAATATGGGA
ATGGACGTAA TAGACTGCGG AGTAGCTGTT CTCTCAATGC ATTCTCCGTT TGAAGTAACC
AGCAAAGTGG ATATTTATAT GGCATATAAG GCTTACAGGG AGTTCTTGAA GTATATCAAA
TAA
 
Protein sequence
MFIVKAFRDI IINIKFLQEG INMSDNKTAG QKLWENLSYN CPNVWNKIGN DEIKNTFAFC 
EEYKSFLDKS KTEREFVRET KKLAESKGFV SIDDVIDSGK PLKPGMKVYR VFKNKIVALA
VIGSQPPEKG FNLVGAHIDS PRIDLKPNPI YEANEMVFLK THYYGGIKKY QWVTIPLSMH
GVIIRKDGSS VEIVIGEDEN DTVFTITDLL PHLAAEQMQK KMSEGITGEN LNILFGSIPY
NDEKVKEKVK LNILKLLNEK YNITEEDLLS AELELVPSFK AKDVGLDKSM VGAYGQDDRV
CAYTALRAVL DLDNVDKTAV CVLTDKEEIG SMGNTGAQSS FLENFIADIC ALSSEKYTDI
ILRRCLSNSK MLSADVNAAI DPTYEGVYDK LNSSFIGKGI VLLKYTGARG KSGASDAHAE
FMGEVRKLFN ERKIFWQTAE LGKVDQGGGG TIAQFVANMG MDVIDCGVAV LSMHSPFEVT
SKVDIYMAYK AYREFLKYIK