Gene Athe_2311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2311 
Symbol 
ID7407730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2449219 
End bp2450226 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content36% 
IMG OID643716675 
Producttranscriptional regulator, LacI family 
Protein accessionYP_002574154 
Protein GI222530272 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000106518 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTACA CAATAAAAGA CATAGCAAAG ATAACTGGCT ATTCTGTTGC GACAATTTCA 
AGGGCGCTGA GCGGTAAAGA AGGAGTAAGC GAGGAAAAAA GAAAAGAAAT CTTAAGAATT
ATAGACCAAC TTGGATATAT TCCAAACCAG AGTGCAAGGA GGCTAAGGAG CAAACAAACA
AAAAATATTC TTGTGATGAT ACCAGACATA GAAAATTACT TTTTTAATAA GCTAATAAAA
GGTATTGAAG CAGAGGCGCG TGAAAAAGGT TATAATATCA TCCTTGGCGA TTTTTCAGAC
TCTCAGGAAA TTGAAGAAGA GTACTACAAG ATGATGAAAG GGCAGATAGC AGATGGGATA
TTGATTGTTG GAAGTTTGAG TGGGCCTCAA AAGGTTATTG AAATGTCCAG ACAGTTTCCA
ATGGTTGTAA TTTCTGACTA TTTTTCAGAC GAGCTTGTGA CGGTGTGTAT TGACAATTTC
AAAGCTGCAT ATGATGCTAC CATGTTTTTG TACAAATGTG GATATCGAAG AATTGCTAAA
ATAACAGGGA AAATTGGATC GATACTATCT CAAGACAGAT TGAAAGGATA TAGGATGGCA
CTTGAGAACT TAGGGTTAGA TAGAGATGAA AAATATATAA AATATGGTGA TTTCAAGTAC
GAAAGTGGCT ACAGGCTTGC AATAGAACTC TTAAATGCAA AACCTTGTCC GGATGCAATA
TTTTGCTCAA ACGATGAAAT GGCAATTGGT GCATGTGATG CAGCAAAAGA TCTGGGTTTT
TCCATTCCCG ATGAACTTGG GATTATGGGA TTTGATAATA TTGAACTTTC ATCAATAGTA
ACACCGAAAA TCACCACAGT TCATCAGCCC CGATACGAAA TGGGGAGGCT GGCTGCACAA
CTTTTGATAA AAAAATTAAC AGGAGAAAAA GTTTCAAAAG GCAAATATAT CCTTGACACT
TCAATAATTC CAAGAGATTC GACCAAAAAT GCAAATATCA CAAAATGA
 
Protein sequence
MKYTIKDIAK ITGYSVATIS RALSGKEGVS EEKRKEILRI IDQLGYIPNQ SARRLRSKQT 
KNILVMIPDI ENYFFNKLIK GIEAEAREKG YNIILGDFSD SQEIEEEYYK MMKGQIADGI
LIVGSLSGPQ KVIEMSRQFP MVVISDYFSD ELVTVCIDNF KAAYDATMFL YKCGYRRIAK
ITGKIGSILS QDRLKGYRMA LENLGLDRDE KYIKYGDFKY ESGYRLAIEL LNAKPCPDAI
FCSNDEMAIG ACDAAKDLGF SIPDELGIMG FDNIELSSIV TPKITTVHQP RYEMGRLAAQ
LLIKKLTGEK VSKGKYILDT SIIPRDSTKN ANITK