Gene Athe_1554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1554 
Symbol 
ID7409062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1643952 
End bp1645082 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content32% 
IMG OID643715926 
Productoxygen-independent coproporphyrinogen III oxidase 
Protein accessionYP_002573425 
Protein GI222529543 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0635] Coproporphyrinogen III oxidase and related Fe-S oxidoreductases 
TIGRFAM ID[TIGR00539] putative oxygen-independent coproporphyrinogen III oxidase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000769723 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGAAATA TAGGACTATA TATTCACGTT CCATTTTGTA AAAGAAAGTG TTATTACTGT 
GATTTTGTGT CATATGAAAA TGTCAATGAT GATGTGATTT TTGCATATTT TAGCGCGCTT
GAAAATGAAT TAATTTATTA TAAAGAAAAT TATGATATAG AGATAGACAC AATTTATTTA
GGGGGTGGTA CACCTTCTTC TATTTCTGCA AATTATATCT TGAAACTATT AGAATTTATA
TGTTCAAATT TCAAGATTAA AAGCAACTGC GAGATAACCA TCGAAGCAAA TCCTGAGAGT
ATCACACATA AGAAACTTCA AAGTTACAGC TTGGCAGGAG TAAACAGGCT AAGTATAGGG
ATACAGTCGC TAAACGATGT AGAACTGAGA GCAATTGGTA GAGTCCATGA CTCTGAGGTT
GCGCTTAAGA TTTTAAGTAG AGTACCTTTA TATTTTGAAA ATTTTAGCGT TGATGTTATC
ACAGGTCTTC CATATCAGAC TTTTAAAAGT TTTATGCAAA CACTCAATAC ACTTTTGGAA
TTTTCACCGC CTCATGTGTC AATTTATTCA CTGAAGATAG AAGAGGGGAC ATTTCTTTTT
GAAAGATATG AAGAGTATAA GAGCTTGTTG CCAAGCGAAG ATGAAGAAAG AAGAATGTTC
TGGTGGGCAA AAAGAATACT TTCTGAGATT GGAATTTATC ATTATGAGAT CTCTAATTTC
GCTAAGAAAG GTTTTGAGTG CAAACACAAC TTGAAATACT GGAATGTAGA GGAATACATT
GGAGTTGGCT GTGCAGCTCA TTCATTTTTT AATGGTTGCA GGTATTATAA TACTTCTAAT
ATAAACGAAT ATGTTAAGAA AATAAAAGAA AATGGTTTAG CTATTGAAGA AAAAGAAATT
ATTTCATGTG AAGAAAGGGA AAAGGAATTT ATTATATTAG GGCTGAGAAA GATAGAAGGA
TTATCCCTTG AGGAATTTAG AAAAAGATTT GGCGTAGAGT TTGAAAGAAA GTACAACTCT
CAAATTGAGA AACTGAAAAA ATATGGATTG ATAGAGGTAA GTAACAATTA TTTGAGACTT
ACAGAGAGAG GAATTGACCT TGCAAATTTA GTGTGGATGG AGTTTGTTTA A
 
Protein sequence
MRNIGLYIHV PFCKRKCYYC DFVSYENVND DVIFAYFSAL ENELIYYKEN YDIEIDTIYL 
GGGTPSSISA NYILKLLEFI CSNFKIKSNC EITIEANPES ITHKKLQSYS LAGVNRLSIG
IQSLNDVELR AIGRVHDSEV ALKILSRVPL YFENFSVDVI TGLPYQTFKS FMQTLNTLLE
FSPPHVSIYS LKIEEGTFLF ERYEEYKSLL PSEDEERRMF WWAKRILSEI GIYHYEISNF
AKKGFECKHN LKYWNVEEYI GVGCAAHSFF NGCRYYNTSN INEYVKKIKE NGLAIEEKEI
ISCEEREKEF IILGLRKIEG LSLEEFRKRF GVEFERKYNS QIEKLKKYGL IEVSNNYLRL
TERGIDLANL VWMEFV