Gene Cthe_2355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2355 
Symbol 
ID4808989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2808454 
End bp2810052 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content42% 
IMG OID640107762 
ProductL-aspartate oxidase 
Protein accessionYP_001038750 
Protein GI125974840 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0029] Aspartate oxidase 
TIGRFAM ID[TIGR00551] L-aspartate oxidase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.730681 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGATGAGA ACAGATATTT GGTGGATTTT GACACAGATG AGCTGCCTAC GGAGTTTCAT 
GATGTTATTA TAATAGGCAG CGGAATAGCA GGAGTTTACA CTGCGCTTGA AATACCTGAA
AAATACGATG TTGTCATACT CACAAAGGAA ACCATTGAAA TAAGCAACTC GGTTCTTGCC
CAGGGAGGAA TAGCCGTTTC CCTTGACAAA GGTGATTCTC CGGAGTTGCA TTTTAAGGAT
ACAATTTATG CCGGAGCAGG CCTGTGTGAC GAGGAAAGTG TGTGGGTTTT GGTTAACGAG
GCTGCGGCAA ATATTGAAAC TTTGTGCCAA TTTGGAGTCA ATTTCGACAG AAAAAGCAAT
GACGAGCTTG CCCTCTCGAG GGAAGGGGCT CACAGCAAAA ACAGGATTAT ACATGCCGGG
GATACAACAG GCAAGGAAGT TTGCGACAAG CTCATATCGG TGGTGAGAAC GAAACAGAAC
GTTAAAATAA AGGAAAGAGT TGCCGCGATA GATTTAATTA CCGAAGACAA TGTGTGCAAA
GGCATACTGG CCTATCATGA GGATAGTTCT TCCTATGTGT TTTACAGAGC AAATGTTGTG
GTGTGTGCAA CAGGAGGATA CGGGCAGTTG TACTCCAACA CAACAAATCC CGAAGTGGCA
ACGGGGGACG GGGCCGGTCT TGCCTACAGG GCCGGTGCGG AGCTCATGGA TTTGGAGTTT
GTGCAATTTC ATCCCACGGT TCTTTTCCAC CCCGAGAACA AAAGCTTTCT TATTTCCGAG
GCGGTAAGAG GGGAAGGTGC CATATTAAGA AATATTAAAG GCGAAAGGTT TATGCCCAAA
TATCATGAGC TTAAAGAGCT AGCACCCAGA GACATAGTTT CAAGATCCAT TTTTCATGAA
ATGCAAAAAA CAAATTCAAA CCATGTATAT CTGGATATCA CATTCAAAGG AAAGGAATAT
TTGGAAAACA GGTTTCCCAA TATTTACAAC ACATGCTTAA GTTACGGCAT AGATATGTCC
AAAGATTATA TTCCCGTTGC TCCGGCTGAA CATTACTGTA TGGGCGGAAT AAGGACGGAT
GTGTTTGGAC GCACAAATAT AAAAGGTTTC TATGCCTGCG GTGAAGTTGC ATGCAATGGA
ATACACGGTG CAAACAGGCT GGCCAGCAAT TCGCTTCTTG AAGGTTTGGT GTTTGGCCGC
AAGATAGGCA AAGAGGTGGA AAATGTAATT GAAGGCAGCC GAAAAGAGCC TCAAAAAGTC
AGTATCAAAG TGAAGTCAAA CAGGGTGGAA AAAAATATAG ATGTAAATAA AATTAAAAAG
GATATCCAGG AAACAATGAC CCGCTATGTT GGAATAGTAA GAGACAGGGA AGGACTTGAA
AAAGCAAAGA AAAAGGTTGA TGATTACTAC GAATTGATAA AAGATATGAA AAATAACAGC
GTAAGCGACT TTGAAATGCA AAACATTGTT CTTGTTTCAA AGCTTGTCAT TGAAGCGGCT
TTGGAACGCA AAGAAAGCCG TGGGGCGCAT TTTAGACTGG ATTATCAAAA AACTGACGAT
GAAAATTGGA AAAGAAACAT AATAAAAAGA AAAATTTAG
 
Protein sequence
MDENRYLVDF DTDELPTEFH DVIIIGSGIA GVYTALEIPE KYDVVILTKE TIEISNSVLA 
QGGIAVSLDK GDSPELHFKD TIYAGAGLCD EESVWVLVNE AAANIETLCQ FGVNFDRKSN
DELALSREGA HSKNRIIHAG DTTGKEVCDK LISVVRTKQN VKIKERVAAI DLITEDNVCK
GILAYHEDSS SYVFYRANVV VCATGGYGQL YSNTTNPEVA TGDGAGLAYR AGAELMDLEF
VQFHPTVLFH PENKSFLISE AVRGEGAILR NIKGERFMPK YHELKELAPR DIVSRSIFHE
MQKTNSNHVY LDITFKGKEY LENRFPNIYN TCLSYGIDMS KDYIPVAPAE HYCMGGIRTD
VFGRTNIKGF YACGEVACNG IHGANRLASN SLLEGLVFGR KIGKEVENVI EGSRKEPQKV
SIKVKSNRVE KNIDVNKIKK DIQETMTRYV GIVRDREGLE KAKKKVDDYY ELIKDMKNNS
VSDFEMQNIV LVSKLVIEAA LERKESRGAH FRLDYQKTDD ENWKRNIIKR KI