Gene Athe_1598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1598 
Symbol 
ID7409428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1690566 
End bp1692314 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content33% 
IMG OID643715967 
Productprotein of unknown function DUF262 
Protein accessionYP_002573465 
Protein GI222529583 
COG category[S] Function unknown 
COG ID[COG3472] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTCC AGCAAGGACA GATGAACTTA TTGGACTTAG TTAGAAAGGC TTATAATGGT 
GAATGTATGC TTCCCGATTT TCAAAGAAAT TTTGTTTGGA CAAGATATGA CATTGAAGAG
CTTATAAAAT CACTTCTTCA AGGTATGTTT ATAGGAACTT TTTTGATTTT AGAAACAAAT
CCACAAAGTG TTCCTTTTAA GGTGATTTTT GTAGAAGGGG CAGAGAAAGT AAATCCCCAA
ATATGTGAGC AACCCAAGAT ATTAATTTTA GATGGACAAC AAAGACTTAC ATCTTTATTT
TACGCTATAT ATAGTCCTGA TATTCCTCTA CGCAATTCTG AGAATCCTTA TGCATTCTTT
ATTGATTTAG AAAAACTTGC TGAAGATAAC ATTGAGGACG CGGTATTCAG CTGGTCAAAA
AAATGGAGAG AGTTCAAGGA GATTATTGAT GAAAACGGAG ATTACAATCT TGAGGTTTTA
AAGGCGAAGA AAGTATTGCC ATTGACAGTG TTTAAAGATA TCCCTGAGTT CTATAGATTA
TGGTTTGGGG AGTATAAATT GTTATTTAAA GACCAGGAAG CAAATAAAAT ATTTGCTTAT
ATTGATAACA TGATAAAATA TAACATTTTT ACTCTATCGC TTGGTCTTTC GTATAATGAC
AAACCCGATG AAATTGCTGC TCTATTTGAA AAAATCAATA GAAGTGGTGT AAAACTTTCT
ATTTATGATC TTCTTGTAGC AAGATTTTAC AAGTTCATAA GACTTCGTGA AAAGTGGGAA
GAAGTATTTG AAAATAGTGT TAATATTAAA AAACTTGCAG GAAGAATAGA TAACACCACA
GTTCCCTATT CATTTATTCA AGCTCTTGCT TTAGCGGCGG ACAAAAATAT CAGTTCACGA
GAAATGCTAA AGATAGATAA TAACATTCTT TCAGACCAGA GCTGGGCCAA AGTGGTTGAT
ATTGCAGAAA ACAAGGTATT GCCTTATTTG CTTCAGATAA ATAACTTTGG CATTGTGGAT
TTTGAAAAGT GGCTACCTTA CTATCCCATT GTCACGATGA TGATTGCACT CTTTTTGAAA
TTTGAGCATC CTGACACAGA TAAAATTGAA AAATGGTACT GGAGTGCAGT TTTTTCTGAA
CGATATTCAG GTTCTACTGA AACTGCTATG GCAAAAGACT TCAAAGAAGT ATGTGTCTGG
TTTAATAATA ATAACTTTTT ACCTGAGGTT GTGGAAAAAT TAAGAAATCA ATTAGAGAGT
AATGTATATA CCTTGAAAGA GGTAAGGAGA AAGGGAAGTT CAAAATATAT CGGAATTTTC
AATCTTTTAT TTAAAAACGG AGCAAAGGAT TTTTATTATC CTGAGAACAT TGCTTTTAAC
CAGCTTGATG ACCATCATAT TTTTCCAGTG AGCTTTTTGA AAGTCAAAGG TGTGGAGGTT
GATGTTGACT CAATTATGAA CAGGACATTG ATTTTTGAAA ATACTAACAG AAGCATATCT
CGTCGCAGTC CCGGTGATTA CATAAGAAAG ATGATTGAAA TTCAAAAATC AAAAGGGCTC
TCAGAGCAAG AAGCAGAACA CAAGGTAAAA GAGATATTAA GGGGCCATTT CATTGATGAA
GAAATGTATA TATTATTGAA AAACACTACT GATAATCTGA CACCTTCTGA GATTAAAGAG
AATTTTGAAA GATTTATAAG TAAACGAGAA AAGTTAATTT TGAATGAGAT AAAAAGGCTG
ATATGGTAA
 
Protein sequence
MNLQQGQMNL LDLVRKAYNG ECMLPDFQRN FVWTRYDIEE LIKSLLQGMF IGTFLILETN 
PQSVPFKVIF VEGAEKVNPQ ICEQPKILIL DGQQRLTSLF YAIYSPDIPL RNSENPYAFF
IDLEKLAEDN IEDAVFSWSK KWREFKEIID ENGDYNLEVL KAKKVLPLTV FKDIPEFYRL
WFGEYKLLFK DQEANKIFAY IDNMIKYNIF TLSLGLSYND KPDEIAALFE KINRSGVKLS
IYDLLVARFY KFIRLREKWE EVFENSVNIK KLAGRIDNTT VPYSFIQALA LAADKNISSR
EMLKIDNNIL SDQSWAKVVD IAENKVLPYL LQINNFGIVD FEKWLPYYPI VTMMIALFLK
FEHPDTDKIE KWYWSAVFSE RYSGSTETAM AKDFKEVCVW FNNNNFLPEV VEKLRNQLES
NVYTLKEVRR KGSSKYIGIF NLLFKNGAKD FYYPENIAFN QLDDHHIFPV SFLKVKGVEV
DVDSIMNRTL IFENTNRSIS RRSPGDYIRK MIEIQKSKGL SEQEAEHKVK EILRGHFIDE
EMYILLKNTT DNLTPSEIKE NFERFISKRE KLILNEIKRL IW