Gene Cthe_1912 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1912 
Symbol 
ID4810770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2277470 
End bp2279077 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content38% 
IMG OID640107329 
Productcopper amine oxidase-like protein 
Protein accessionYP_001038324 
Protein GI125974414 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000429846 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA TTGCAAGAAA AATATCAATG TTACTGGTCG TTGCACTTTT GGCGGTATCG 
ATGGTGGCCT GTACCAGTGA TGAAATTGCA TTAATTGAAG CCATGTCCAA GACATCAGAA
ATTTCATCCT ATGAAGGAAA TTCAAAAATT CAGTTAAGTT TTAAAGGCCA GGGATTTTCG
GAAAAAACGC AGAAAGTTTT CGATTTTCTG GCTTCATATG TTGACGGATT CACTTTTGAG
GCAAATCAGA AGTATTCGTC CAACGATGAA AAGACAAAAG CAACGCTTGC CATGGACGGA
AATGTGGATA TGCAAGGTTT GAGCGTTAAA TATAAATATT GGACCGATAT GGACTTTACA
ACCGAGAATC CTAGCTTAAT ACAGATTGTT GAACTTCCGC CGGCAATTAC CCAGCCGATG
TTTACCTTTG CCAACACAGG AACAAAAAAA TATATTACTA TTGATTACGG TAGTGTATTG
TCTGCGGAAA ATAACGGGGG CATATCTCTT AACCCGGAGA ATTTGGCAAA AAACAGTGTT
GAGCTGCAGG AAATGTTGCT GGACTTTGTA AAAACAACAG CCAAGGACTT TGACCCCGGT
ATGGTGGCAG TAACCAAAAA AGGTTCCGCT GTTACTGACA AGGGAGAAAA AGTTACGGAA
TACGAATTGA AATTGGATGA TGCTGCAGCT AAAAAATTAT TGCATGCTTT TATAAATGAT
GTCATATTGC AGGAAGATAC TATAGAGTTT GGCAAAAAGT ATATGGAAGC TGCCATAAAT
ATGTATGATT TCCCGGAAGA GGAAAAACAA GAGGCATTGG ACGAAATCAA CAAAGGTCTG
GATGAATTTG CATCCCAGCT TCCTGCATAC AGGGACAGTG TTACACAAGT TTTTGAGTCG
ATAAAAGACG TCAAATTCTT TGGTGACAAA GGTTTGGTAG CAAAATACTA CATAAACAAT
GACGGCTTCC TTGTAGGAGG AAAATCATCC ATTGACATTA AGATAAAAAT GGCAGATTTT
GCGGCATTAT TGGGTGACAA TTTTGACGAA AAAGATAAAA ATGGAGTGCT TTATTTAACA
ATAGACGCTG AAAGTTCTGT TTACAACATA AATAAAGAAG TATCCATAGA ACTTCCGGAA
ATAACTGAAG AAAATTCCTT TGATGTTTTG AAAGGATTTA TGCCGATACT GTCTGGTATT
GGTTCTGCTC CGGGTATCGG TGAAGAGGAT TATACATATG ATATTCCTGC TTTGTCAGAC
GGAATTAATG TTGTTATGAA TGGAAAAGTA GTATATTTCC CTGATGTAAA GCCTGAAAAC
GTCAATGGAA GAGTGTTGGT TCCAATAAGA ACCATATCCG AGGAAATGGG AGCGGAAGTA
ACCTATAATG ATGCAACAAA GCAGGTTCTT ATTGCCAAGG ATGATACGGA AATTCTTCTT
ACAATTGGTT CCCAGGAAGC TTACGTTAAC GGCGAGAAGA TAATGCTTGA TGTACCGGCA
ATGATTATTG AAGGACGTAC AATGGTTCCG TTAAGATTCA TATCTGAGAG TATGAATGCA
ACGGTTGAAT GGGACGGAGA AGCTCAGATA GTATACATAT TTTATTAA
 
Protein sequence
MKKIARKISM LLVVALLAVS MVACTSDEIA LIEAMSKTSE ISSYEGNSKI QLSFKGQGFS 
EKTQKVFDFL ASYVDGFTFE ANQKYSSNDE KTKATLAMDG NVDMQGLSVK YKYWTDMDFT
TENPSLIQIV ELPPAITQPM FTFANTGTKK YITIDYGSVL SAENNGGISL NPENLAKNSV
ELQEMLLDFV KTTAKDFDPG MVAVTKKGSA VTDKGEKVTE YELKLDDAAA KKLLHAFIND
VILQEDTIEF GKKYMEAAIN MYDFPEEEKQ EALDEINKGL DEFASQLPAY RDSVTQVFES
IKDVKFFGDK GLVAKYYINN DGFLVGGKSS IDIKIKMADF AALLGDNFDE KDKNGVLYLT
IDAESSVYNI NKEVSIELPE ITEENSFDVL KGFMPILSGI GSAPGIGEED YTYDIPALSD
GINVVMNGKV VYFPDVKPEN VNGRVLVPIR TISEEMGAEV TYNDATKQVL IAKDDTEILL
TIGSQEAYVN GEKIMLDVPA MIIEGRTMVP LRFISESMNA TVEWDGEAQI VYIFY