Gene Cthe_0824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0824 
Symbol 
ID4810442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1002382 
End bp1004157 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content38% 
IMG OID640106241 
Productcopper amine oxidase-like protein 
Protein accessionYP_001037252 
Protein GI125973342 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.195798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAA AGCTGTTATT CATGGCGCTG TGCATCCTTG TTTTTTACTT TCCGTTGATT 
TCCCATGCCA GTGCAAACAT CACGGTTGTG GTAAACGGCG AAAAAGTTAA CTTTACTGAC
CAACAACCTT TCATTGACAG CAACTCAAGA ACAATGGTAC CCATAAGATT CATTTCCGAA
GCCCTCGACG CCAAAGTGGA TTGGATTGAA AAAGAGCGCA TGGTGGTAAT AAAAAAGCAG
GGAACGGAAA TTTCACTGGT TGTAGGTATG AAAACCGCCA AAGTCAATGG TAAAGAAATC
AAGCTTGATA CTTCATCGGT TATTGCAGGA GGCAGGACTT TTGTCCCTGT AAGATTTATC
TCAGAAGCCT TTGGTGCTAC CGTAGAATGG GACGGTAAAA ACAAAATTGT CACAATCACA
ACCAAGCCGC AAGCCGGAAA TGAAATCGGT AAAATGCCCG AATCAAGCTA TTACAACGAA
CTTTACAACC TCTTCACCCT CGTGCCCAAA GGTATGAAAG GCTCCAATAT AGAGCAGTTC
GCTTATTACA CCTTTAAAGA AGATGGTACT GTCCTCAACC TCAACTTTAC CGAAGAGGAA
ATATTCAAAG GTCATTCATT CCCCGAGTTG TCAAAAGGCG GGGTTATCTT TTCCCCCAGC
AACCTTGAAA CCGTTGAATC ATACACTGTT GAAATCTTCA AAAATGTGTC AAGGACAAAC
AAACAGCTTG TCGGTCAATA CATATATTTT TCCGATTTAA AACCTGAAAA CCTTATGGTT
ATTGAAAAAA ACGGGTTTAC TTTTGTCACC ACAAAGGGAT TTAAACCTTT TGAAATCAAA
GAAACCGGCT ACAAGCTGAT GAGTAAAAAA CTCTTTATAT ACAACGACAA AAACCTGTAT
GTTTTTTCAT TTATTTATGA CCCCTCGGAC AGAAAGTATC TTACGGAAAG TGTTATGGAC
AGCATCATTA ATTCCATTGA AATAAACGGT ACCAAAATCA ATCTGAAAAA ATCAACAACA
CCGGGCAAAA ACGGCGAAGT TGAAGATATT TTACCGGCAG AAGCCAATTA TACCAGGAAA
ATGACTGTAA AAGAAAATGA GGAACGGCTT AAAAAGAAAT ATTTTTCTCA AGGATTTGAC
CTTTCGGACA TGGTCAGAAT TGAAGGAGGC ACCTTTATCG ACGAAAACGG TGAAAAAGTA
ACCGTAAAGC CCTTCTACGT CAGCAAAAAC TTAGTGACAA TCAGCGAATG GAACAGCATC
TCAAAAAATA AAATCAACAT CAAAGACCTC AACGCAAAAT ACAATTTAAA CATAAAATCC
GAAAATTATC CGGCCGTATT TGAAACCGAA GAAAAATACG GCACGGTCAA AATAAAGAAC
AATATGGAGC TTTACGTGTT CTGCAATGAA AAAAGCAAAA GTTTCGGAAT CGAAGAATAT
TATACCTTCC AAAAAACCAG TTACGGTACT TCTACCATCT TTGAACACAA CGGAGGATTC
AGGTTGCTGA GTGAAGATGA ACTAAAATAC ATATTGAGAA AAACCAAATC CGATGCCTAT
AAAAACAACG TAAAATCAAA TACCGTTTCC GAAGTCGGAC GTTCTTCCAA AAACGAGTTT
GGAATTTTTG ACTACGACTC TAATGTTGCG GAAGTAACGG ATTTTGATTC GGTACTCAAA
ATCAAAGACA ATTCTCAAAT GCTGTGCGGT TTCAGATATG CAAAAGATAT TGGAAATTCA
CCACAAGATT TGATACTGGA TTTTTTCAGT AATTAA
 
Protein sequence
MAKKLLFMAL CILVFYFPLI SHASANITVV VNGEKVNFTD QQPFIDSNSR TMVPIRFISE 
ALDAKVDWIE KERMVVIKKQ GTEISLVVGM KTAKVNGKEI KLDTSSVIAG GRTFVPVRFI
SEAFGATVEW DGKNKIVTIT TKPQAGNEIG KMPESSYYNE LYNLFTLVPK GMKGSNIEQF
AYYTFKEDGT VLNLNFTEEE IFKGHSFPEL SKGGVIFSPS NLETVESYTV EIFKNVSRTN
KQLVGQYIYF SDLKPENLMV IEKNGFTFVT TKGFKPFEIK ETGYKLMSKK LFIYNDKNLY
VFSFIYDPSD RKYLTESVMD SIINSIEING TKINLKKSTT PGKNGEVEDI LPAEANYTRK
MTVKENEERL KKKYFSQGFD LSDMVRIEGG TFIDENGEKV TVKPFYVSKN LVTISEWNSI
SKNKINIKDL NAKYNLNIKS ENYPAVFETE EKYGTVKIKN NMELYVFCNE KSKSFGIEEY
YTFQKTSYGT STIFEHNGGF RLLSEDELKY ILRKTKSDAY KNNVKSNTVS EVGRSSKNEF
GIFDYDSNVA EVTDFDSVLK IKDNSQMLCG FRYAKDIGNS PQDLILDFFS N