Gene Athe_0564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0564 
Symbol 
ID7408690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp636627 
End bp637742 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content36% 
IMG OID643714947 
Productriboflavin biosynthesis protein RibD 
Protein accessionYP_002572463 
Protein GI222528581 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0117] Pyrimidine deaminase 
TIGRFAM ID[TIGR00227] riboflavin-specific deaminase C-terminal domain
[TIGR00326] riboflavin biosynthesis protein RibD 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGAGCTC TTTCACATAG CTACTACATG AACATGGCAC TTGAGCTTGC AAAGAAAGCT 
TCTCCTTTGG TGCTGCCTAA CCCAAGAGTG GGATGTGTGA TTGTAAAAAA TGGAACTGTA
ATTGGAAAAG GGTATCACCA GAAGTATGGT GAAAAACACG CAGAAGTTTT GGCAATTGAG
GATGCAATAA AAAATGGATA CTTGCTCAAA AACGCAACAA TGTATGTTAC ATTAGAGCCA
TGCTGTCATT TTGGTAAACA GCCGCCTTGT ACAGATGCAA TTATAAAAAG TGGAATTAAA
AAGGTTGTGG TTGCCACCAA AGACCCAAAT CCTCTTGTCA ATGGCAAGGG CATACAGATA
TTAAAGCAGC ACGGAATAGA AGTTATAGAA GGTGTGCTGC AGAAAGAAGC AGAAAGTATT
AACAAGGATT TTTTTAAGTA CATGAAGACA GGCATTCCCT ATATCGCTAT AAAAGTTGCC
CAGAGTATTG ATGGCAAAAT TGCAACACCT TCAAATAAGA GGTTTTTGTT TAACACTGAA
GAGGAAAATG TCTTTGTGCA CAGTCTTCGT CAAAAATATA TGGCAATTTT GGTGTCAGTA
AATACTGTAA TTTCAGATAA CCCAATTTTA AATGCAAGGT ATGGCCAGAT TGTAAGGCAG
CCTACAAGAG TTGTGCTTGA TTCAAAGCTT CGAATTCCTT TAGAATGCAA TATTGTAAAA
ACTGCTGACA AGTATTCTAC CTACATTGTG TGCAGTGAAA ATGTAAATGA TATTCAAAAA
ATAGACCTTC TTTCTCAAAA AGGAATAAAA ATAATCTTTG CAAAGTCCTC AGAAGATGGT
CATCTTGACC TTTCAGATGC ATTTTCAAAA CTTGCACAGC AAAAGATAGT ATCAGTCCTT
GTTGAGGGAG GAAGTCTACT GAACTTTTTT CTTTTGAAAC AAAGAATTGC AGATTACTGG
TATTCATTGA TATTCAATGT TTTTATTGGT GGGCAGGACA CAAAAGGTGT AGTTGGTGCG
GAAGGCTTTG AGGACTTTTT CCCAAAGCTT GCAAATACAA AAGTCACAAC ATTTAAAAAT
TCTACTATAA TCGAAGGAGA TATAAGTTAT GTTTAG
 
Protein sequence
MRALSHSYYM NMALELAKKA SPLVLPNPRV GCVIVKNGTV IGKGYHQKYG EKHAEVLAIE 
DAIKNGYLLK NATMYVTLEP CCHFGKQPPC TDAIIKSGIK KVVVATKDPN PLVNGKGIQI
LKQHGIEVIE GVLQKEAESI NKDFFKYMKT GIPYIAIKVA QSIDGKIATP SNKRFLFNTE
EENVFVHSLR QKYMAILVSV NTVISDNPIL NARYGQIVRQ PTRVVLDSKL RIPLECNIVK
TADKYSTYIV CSENVNDIQK IDLLSQKGIK IIFAKSSEDG HLDLSDAFSK LAQQKIVSVL
VEGGSLLNFF LLKQRIADYW YSLIFNVFIG GQDTKGVVGA EGFEDFFPKL ANTKVTTFKN
STIIEGDISY V