Gene Athe_2474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2474 
Symbol 
ID7409343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2612945 
End bp2613874 
Gene Length930 bp 
Protein Length309 aa 
Translation table11 
GC content38% 
IMG OID643716837 
Producthypothetical protein 
Protein accessionYP_002574315 
Protein GI222530433 
COG category[S] Function unknown 
COG ID[COG5464] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01784] conserved hypothetical protein (putative transposase or invertase) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000139417 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGATA GTTTGCCACC ACAGGAGCAT GATTCAACTT TTAAGTTTTT GTTTGAAAAT 
GCAAAAGATA TCCTTTTCCT TGTCAGGGAC GTAATAGGCT ACAGCTGGGC AAAAGATATT
CAAGAAGACT CAATAGAACT TGCGAACAAA GAATTTGTAG ATGAAGACTT TCTGCAAAGA
AGAGCAGACG TCATAGCAAA AGCAAAACTA AAAGACAGGG AAGTATACTT CTACATTATC
ATCGAAAATC AATCGAGAGT CGATGGGAAT ATGCCAAAAA GACTTTTGGA GTACATGATT
TTGCTATGGG CAAAGAAAAT CAGAGAAGGT GTAAAGAAAC TTCCGGCGAT AATTCCAATA
GTAACATACA ACGGTCTTGA TAAGGACTGG GATATACCAC AGGAAATAAT CAGCGAATTT
GATATTTTCA AAGACGATAT TTTCAGGTAC GCTCTTGTAA ACATTTCAAA ATTAGATGCA
AAGGCTCTGT TGCAAGAGGA AGAGGATGTC TTGAGCCCGG TAGTGTTCTA CTTAGAACAA
GTGCGAGATG ATACAGAAAA GTTAATTGAG AGGCTAAAAG AGCTTGTACC AAAACTGCAA
AACTTCAGTC AAACCAATAT GGAGAGGTTT TTAACATGGG CGGGAAATGT AATACGTCCG
AGGTTTCCAA AAGAGGAAAG GGAGAAGTAT GATAAGCTTG ACCAGGAGCT AAAGCAGGGG
GGAGTGGCGA AAATGGGTGA GTTTGTATCT AATGTTGCAA AACTACTGGA TGAAGCACAG
ATGAAAAAGT ACAACGAAGG CGTTATTAAA ACAAGAATAG AAATAGCAAG GAACATGATA
AAAGAAGGGG CAGAGGACAT CTTTATAGCA AAGGTGACAG GACTTACAAT TGAAGAAGTG
AGAAAACTCA GAGACGAAAC TCTATCATAA
 
Protein sequence
MRDSLPPQEH DSTFKFLFEN AKDILFLVRD VIGYSWAKDI QEDSIELANK EFVDEDFLQR 
RADVIAKAKL KDREVYFYII IENQSRVDGN MPKRLLEYMI LLWAKKIREG VKKLPAIIPI
VTYNGLDKDW DIPQEIISEF DIFKDDIFRY ALVNISKLDA KALLQEEEDV LSPVVFYLEQ
VRDDTEKLIE RLKELVPKLQ NFSQTNMERF LTWAGNVIRP RFPKEEREKY DKLDQELKQG
GVAKMGEFVS NVAKLLDEAQ MKKYNEGVIK TRIEIARNMI KEGAEDIFIA KVTGLTIEEV
RKLRDETLS