Gene Athe_0137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0137 
Symbol 
ID7408499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp171716 
End bp173989 
Gene Length2274 bp 
Protein Length757 aa 
Translation table11 
GC content31% 
IMG OID643714542 
Productprotein of unknown function DUF324 
Protein accessionYP_002572065 
Protein GI222528183 
COG category[L] Replication, recombination and repair 
COG ID[COG1337] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) 
TIGRFAM ID[TIGR01877] CRISPR-associated endoribonuclease Cas6 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAGTA TAAAAATTTT TCTAAAAACT TTGTCATATA CTTTAGTTGG TTGTGGGGAA 
AGCAAAGGAA TGGTAGACAA CGATATAGTT TACGATAGTT TAGGTCTTCC TTATATACCC
TCAAGGCGCA TAAAAGGGCT TTTCAAAGAA AGCGCTACAG AAGTGTGTGA AATGCTGGGA
ATATCAATTG AGTTGGTGGA TTCTTTATTT GGTAGAGATG GCTTTAACCC TGCCAAAATT
TATATAGACA ACTTATATGT GACTAATTAT AAGGAGATAA AAAGGGAAAT TGAAAAACTC
AAGGAAGAAG AATATTATAA ACATTTTCTT TATCCAGAGA AAATAATTTC TTGTTATACA
GTTGAGCGCT ACCAAACAGC AATTGATGCT GAAAATGGAA CAGCAAAAGA AAATTCTTTA
AGAACATCAA GAGTTTTGAA GCCGAACATA GAATTTGAAG GTGCAATATT TGAATTAAAA
CCTCTTTCAG AAAAAGAAAA AGCTTTGCTT TATCTTGCTT CTATAAATTT GAGAAGAATT
GGCACGTTAA GAAACAGAGG GTTTGGTCAA GTAAGATGTT GGATAGAAGG TATTGATTTC
AAAAATGTTG TTGAAGCAAT AGAAAAGTTA AAATGTGAAG AAGAGAGTTT TTCAGAGGTA
AATAAGTCAA TAAAAGAATT CAAGTGTACG GGGACAGAGG ATAGAACATT AGCAAAGCTT
GTCTATACAA TAAAGACATT GTCACCAATA GTGATAGCAA GTCCAAGAGG GGAACAAAAC
ACTGTCTACA CTAAAACATA CATCCCGGCT CTCACTGTTA AGGGATTAAT AGTTAATCAA
TTTATAAGAC TTATGGATTT GGGAGAAAAT GCCCATCAAA ATGATTATTT TTACAAGATG
TTTCTAAAAG GAGAAATAAT TTTTACTCCA GCTTATCCTG CTAAAGAGAA TAGAATATTT
GAACCAATTC CACTTTGTCT TCAACCTGAA AAAGGGAGCG AACCTGATGT TTTGTATAAT
CTATTTGACC CTACAAATGA ATCAAGAAAT ACCAAACCAA TGAAAAAATT TTGGTACTTT
ACAGAAGATA AAGTAGATAA TGGAAATTGT GTATATGAGT TGTATGAACC AGAGACAATT
TTCTATTTTC ACAATTCAAG GGATAGACAA AAAGGACATA GTGTAGGTGA GGGTATTTTC
TATTATGAAG CAATAGATTC TGAGCAAGAA TTCAAAGGTG AGATAGTAGG ACCCAGGAAT
TATCTTTTGC AACTCAAGGA ACTAATTGGT TCATTTGAAG GATTTATAGG CAGGTCCAAA
ACTGCTCAGT ATGGTTTGGT AAAGTTTGAT TTTGGAGAAA TTAAAGATAT AGAAACTCAG
GAAGATATTG ATGACGAATA TATCATTTAT GCTTTGTCTC CTATAATAGT TTACAATTGC
TATGGTTTTA CAGAGCCTTC GGAAAGGGTT TTAAAATCTT ACTTGGCAAA AATTTTAGAA
TGTAAAGAAG AAGATATAGA AATCATGAGC TCTGCTGCAA AGGTTGAACG ATTAGAAAAT
TTTGTTGGAG TATGGAAAAT GAAAAGCAGT TCTGAAATAG CTTATGCTGC GGGTTCTTGT
TTTAGAATTA AATTAAGATG TCAAATTAGG GATTTTAAAG ATAAAATAAA TGAAATTCTT
ATTAATGGTA TAGGTGAGAA GACGCAAAAT GGTTATGGTA GGGTAAAAAT TTATTTTGAT
TTAGCTAAGA AATATCAAAA AAGAGATTCT AAAGAGGAAC ATAAAGAAAA AGTTATAATT
AATAAAAGTG TTAATCTCAT TGAGGATATA GTGAAGTCAG ATTTATTTAT GTGGGCTAAA
ATTAAAGGCT TTGAAAAAGC CAGAGAGTTT GACAAGAAAA AAGAAATTTC TAGCCATTTA
ATAGGCAGGT TAGAGAATTT ATTGTCAGAT TCTAAAAATT TAGACACATG GAAAAAAGGT
CTTTCAAAAT TTAGCGACAA ACAAGCAGGA AAAAAATTAA AAAAAGTTAG ATTGTGGGAT
GAATTGTTTG ATGAAAATAG CAAAGTGGAT ATCACTGTTA AATTATATAA TTCTCTTGCT
GAAGAAAGGG ACTTTAATAA ATATTTTCAT AAAAGTATTT TCGGAAATCT AAAGTGGAAT
ATAGAGCAAG ATACAGATTT TCTTTGGGAA TTTTTCAAAG CATATTGGCT TTCTTTTCTG
CGATATTTAA GGTTATTTAA AAAACGGGAG GAAGAAGTTA ATGTGCAAAA TTAA
 
Protein sequence
MNSIKIFLKT LSYTLVGCGE SKGMVDNDIV YDSLGLPYIP SRRIKGLFKE SATEVCEMLG 
ISIELVDSLF GRDGFNPAKI YIDNLYVTNY KEIKREIEKL KEEEYYKHFL YPEKIISCYT
VERYQTAIDA ENGTAKENSL RTSRVLKPNI EFEGAIFELK PLSEKEKALL YLASINLRRI
GTLRNRGFGQ VRCWIEGIDF KNVVEAIEKL KCEEESFSEV NKSIKEFKCT GTEDRTLAKL
VYTIKTLSPI VIASPRGEQN TVYTKTYIPA LTVKGLIVNQ FIRLMDLGEN AHQNDYFYKM
FLKGEIIFTP AYPAKENRIF EPIPLCLQPE KGSEPDVLYN LFDPTNESRN TKPMKKFWYF
TEDKVDNGNC VYELYEPETI FYFHNSRDRQ KGHSVGEGIF YYEAIDSEQE FKGEIVGPRN
YLLQLKELIG SFEGFIGRSK TAQYGLVKFD FGEIKDIETQ EDIDDEYIIY ALSPIIVYNC
YGFTEPSERV LKSYLAKILE CKEEDIEIMS SAAKVERLEN FVGVWKMKSS SEIAYAAGSC
FRIKLRCQIR DFKDKINEIL INGIGEKTQN GYGRVKIYFD LAKKYQKRDS KEEHKEKVII
NKSVNLIEDI VKSDLFMWAK IKGFEKAREF DKKKEISSHL IGRLENLLSD SKNLDTWKKG
LSKFSDKQAG KKLKKVRLWD ELFDENSKVD ITVKLYNSLA EERDFNKYFH KSIFGNLKWN
IEQDTDFLWE FFKAYWLSFL RYLRLFKKRE EEVNVQN