Gene Athe_0321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0321 
Symbol 
ID7407638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp365630 
End bp366946 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content40% 
IMG OID643714711 
Productprotein of unknown function UPF0052 and CofD 
Protein accessionYP_002572234 
Protein GI222528352 
COG category[S] Function unknown 
COG ID[COG0391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01826] conserved hypothetical protein, cofD-related 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATTTG ATTTTCTAAA AACAGGTATT TATATAAAAA GATGGATTGT TTTGATGTTA 
TTAAGTGTTA TATTAATTTC TGCCTCTTTG ACGAAAGATA TTGACATTTT TAGTATTTCA
GTGGGACTTC GCACCTCTTT GCGCATAGGA TTATTTGTTT TAGGAATTGG AGTTTTTTTG
ATTTCGCTGA TGGGTTTAAT AAAGGGTTTT ATAAGGCTTT TGAACAAAAG TTTGCCATAC
AGAATAAGTC AGAAGACCAT ATTTGATGCT ATATATTCAA AAGGGTTTTT GGAAAAAGGG
CCAAGGATTG TTGCAATCGG TGGTGGTACA GGACTTTCTA CAATGCTCAG GGGTATAAAG
AATCTCACAG CAAACATCAC AGCAGTTGTA ACTGTTGCAG ACGATGGAGG GGGGTCAGGA
AAGCTCAGAG AAGACCTTGG CATGCTTCCA CCGGGTGATA TCAGAAACTG CATACTGGCC
TTAGCAAACA CAGAAGAGAT TATGCAAAAG CTTTTGAATT ACAGGTTCAA GGAAGGAAGC
CTTAAAGGTC AGAGTTTTGG CAACCTGTTC TTAGCTGCAA TGACAGGAAT TGCAGGTAGC
TTTGAAAAAG CGGTAAAACT CATGAGCGAG GTTTTAGCAG TTCGAGGAAA AGTTCTGCCT
GTGACTCTTG ACAATATTAA TCTTTGTGCT GAGCTTGAAG ACGGCAGGGT AGTTGTTGGC
GAGTCAAAAA TTCCTGAAGA GGTGAAAAAT TCTAAGACCC CAATAAAAAG AGTTTTTATA
ACTCCATCTG ATGCAAAGCC ATATCCTGAA GTTCTGGATG AGATTGAAAA GGCGGATGTG
ATAATAATTG GACCGGGGAG CCTTTATACA AGCATTATGC CAAATCTTGT GTTCAAGGAG
GTTGTTGAGA GTATAAAGAA AAGCAGAGCC AAAAAGATTT ACGTTGCAAA TATTATGACA
CAGCCTGGTG AAACTGACGG ATACCTCTTG TGTGACCATA TAGAAGCAAT AGAGAGACAT
TGCGGCGGAA GGATTTTTGA CATTGTTATT GTGAACAACC AGCCAATTCC TCCAGATGTG
CTGGAGAGAT ACAGGGAAGA TGGTGCACAG CCTGTCTGGA GTGACAAGAA AACAGTTGAG
AAGGGATATA CAGTGGTAGA AGAAGGACTT TTGAGCATTT CTAACGGACT TATTCGTCAC
AACTCAGCAA AACTTGCAAG GGTTGTGAGC AACATCGTCT ATGGCAAGGA TGTGATCCAT
GAGTACAGAT TAGGGTATCT GAGATTGAAA AATTTTAGTA AACCGTTTAA GAGCTGA
 
Protein sequence
MIFDFLKTGI YIKRWIVLML LSVILISASL TKDIDIFSIS VGLRTSLRIG LFVLGIGVFL 
ISLMGLIKGF IRLLNKSLPY RISQKTIFDA IYSKGFLEKG PRIVAIGGGT GLSTMLRGIK
NLTANITAVV TVADDGGGSG KLREDLGMLP PGDIRNCILA LANTEEIMQK LLNYRFKEGS
LKGQSFGNLF LAAMTGIAGS FEKAVKLMSE VLAVRGKVLP VTLDNINLCA ELEDGRVVVG
ESKIPEEVKN SKTPIKRVFI TPSDAKPYPE VLDEIEKADV IIIGPGSLYT SIMPNLVFKE
VVESIKKSRA KKIYVANIMT QPGETDGYLL CDHIEAIERH CGGRIFDIVI VNNQPIPPDV
LERYREDGAQ PVWSDKKTVE KGYTVVEEGL LSISNGLIRH NSAKLARVVS NIVYGKDVIH
EYRLGYLRLK NFSKPFKS