Gene Cthe_1909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1909 
Symbol 
ID4810767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2270228 
End bp2271700 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content34% 
IMG OID640107326 
Productcopper amine oxidase-like protein 
Protein accessionYP_001038321 
Protein GI125974411 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0548415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAGA AGCTTTTTAA GTTGTTGACC TGTATTTGTA TAGTAATTAT AACGATTCAA 
ATAAATTTAA TATGTATTAA TGCTGGCACT AAAAAAATAG TAATTGAAGA TATAAAGGAT
ATTTCAAATG CACAGCAAAA AGTGATTGAC GGCTTGGTTT ATATTGAAGC AGGACCGGTA
CTGGAAGCAC TGGGATGGAA TTTAAAGTGG AAACCAACTG ACAAAGTGGT TGTTTGTACT
AAAGGTAATA ATTTGATGAT ACTTAAAGTG GGTAGTAATG TGGTTGTTTT GAACAATCAA
GAGATTTACC TTCAAGGCGA AGTGATTATT GACTCTGACA AAGTATTAAT TGAAAGCCAA
TTTGTAGCAC GCTATCTGGG TGAGCAGGTT ATATGGAACG GTGAGGATAA TGTTGTTATA
CAATCAGCTG ATTATAAAAC CAAAATTAGT TTAGATGGCA AGAGAAATAT TGTTATACTT
GGAAACGGTA TAATTGCAAA TATACATCAG CCTTGTCAAA CAGACACACT TTTTAGCATG
TTGGATCGTG CTGACAGCTT GCTGGAAAAT AATTATCCTG ATGAAGCACT TTCAATATAC
CAAGAAATGC TGTATGAGAT TTCCAAGGAC GAAAATCCCG ATATATATTT GAGAATTATG
AATAATATCG GTAATGCGTA TTATATGTTG GCAGACAGAA GAGATAGGGA GAAGAAGCTG
TTGTTGGCTG TAAGTTCTTA TGAAGAGGCT TTAGAGTGTT ATAATAAACA GGAAGTGGAA
TTTGATTATG CTACTATGTT ATGTAACCTT GGAAATGCAT ATAAAGGATT AGCTGATATT
TCCGGGAAGA AAGAATATTT ATTGAAATCC ATTAATTGTT ATGATATTGC TTTAGCACAG
TCCGAAGTAC CTTTTTTGGA TAAAGCTCTT CTCTACTATA ACATGGGTAT AGCGCGTGCA
GATAATGGCG AAGGAGGAAA AGCTGTATAC AATTTATTAA GAGCATGGTG TATTTATCAG
AAAGCGTTAA AGCTATATAC CATTGAAAGC AGACCTGATA TTTATGCCGA GATAAACTAC
AACCTGGCAA ATATCTACAA AATATTCTCT ACAATTGACC GAAGCGGAAC GTTTTACGAA
AAGTCAATAA CATCATATAA CAAAGCGTTG AAAGTATGGA CAGCAGAGAG CTATCCAATA
AATTATGCAA TGGTTTATAA ATGTATCGGT GATTTATGCA AGCAGCAGTA TGCAATAGAT
AATAATGTAC AAAATTTAGT AATGGCAGTT GAAGCGTATA ATGAAAGTCT GGAATTTTTC
CCGCTTATCA CATATCCTTT GAAATATTCA ATGATATATA TGGAGCAGGG CAATACCTAT
ATGCTTTTGA AAAAAGCAGA ATCGGATGAA CAATGGCTGA AAAAGGCTAT GTTTTGCTAT
AAACAGGCTT TAAAGCCATT TTCAGATGAA TAA
 
Protein sequence
MEKKLFKLLT CICIVIITIQ INLICINAGT KKIVIEDIKD ISNAQQKVID GLVYIEAGPV 
LEALGWNLKW KPTDKVVVCT KGNNLMILKV GSNVVVLNNQ EIYLQGEVII DSDKVLIESQ
FVARYLGEQV IWNGEDNVVI QSADYKTKIS LDGKRNIVIL GNGIIANIHQ PCQTDTLFSM
LDRADSLLEN NYPDEALSIY QEMLYEISKD ENPDIYLRIM NNIGNAYYML ADRRDREKKL
LLAVSSYEEA LECYNKQEVE FDYATMLCNL GNAYKGLADI SGKKEYLLKS INCYDIALAQ
SEVPFLDKAL LYYNMGIARA DNGEGGKAVY NLLRAWCIYQ KALKLYTIES RPDIYAEINY
NLANIYKIFS TIDRSGTFYE KSITSYNKAL KVWTAESYPI NYAMVYKCIG DLCKQQYAID
NNVQNLVMAV EAYNESLEFF PLITYPLKYS MIYMEQGNTY MLLKKAESDE QWLKKAMFCY
KQALKPFSDE