Gene Athe_1197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1197 
Symbol 
ID7409671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1289114 
End bp1290331 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content39% 
IMG OID643715562 
Producttransposase, IS605 OrfB family 
Protein accessionYP_002573070 
Protein GI222529188 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAAAA CACAGAAAAA TCATATAAGA TGTGATAAAC AAACATACAG ACTGCTACGT 
CAGCTTTGCC ACTTCTCAAA AAACTTGTAT AACTATGCTC TGTACCATAT AAGGCAACAC
TACTTTAAAA CTCAGGAATA TTTGAGATAT GAGAGTGTAT ATCATCTTGT GAAAGGCAAC
GAAAACTACA GACTTTTGCC ATCACAGGTT GCCCAGCAAA CACTTATCTC AGTGGATGAA
GCTTTTAAAT CTTTTCTAAG TCTTTTGAAA GCAAAAAAAG AAGGGAAGGT AAAAGAAAAA
GTTTCAATGC CCAAATACCT GCCGAAGGAT GGGATGTATC AGATAGTTTT TCCGAAAGAT
CAGTTCAAGA CAGAAGGAAA AAAAGTACGA TTGAGTTTTG GCAGGGGTTT TGCAAAAGAG
TTTGGGGTAA AGTACCTGTA TTTTGACCTG CCAGCTACGG TTTTAGGTAA GAAAATTAGG
GAAGTCAGGA TAGTGCCAAG GCTGGGTGGT AGATGGTTTG AGATTGAGTA TGTGTATGAA
GATAAGCAGC AGCCAAGAAT TTTTGATTTG AGCAGGTATT TAGGGGTAGA CTTAGGGCTT
GACAATTTTG CAGCGGTGGT TGATACCATC GGGACTGCCT TTTTGATAGA AGGCAGGTTT
TTAAAATCAG TCAACCGATG GTACAACAAA GAAAGGGCAA GACTTCAGTC AATCTACAGC
AAGCAGGGGA TAAAATGTGG CAGAAGACTT GCTCAGATTT CTCTTGAAAG GCAGCATGTA
ATTGACAACT TTTTGAATCA AGCCGTAAGT TTGATAATCA AGCACTGTTT GAACAATCAG
ATAGGTGCAG TAGTTGTTGG CAGGATGAAA GGTATAAAGC AGGGGATAGA GCTTGGTGTT
GTAAACAATC AAAATTTTGT GGGGATACCA TATGACAAGT TCAAGAGGAA GCTAAAATCA
AAGTGCATGT ATTATGGTAT AAGGTATATG GAAGTGGATG AGGGTTATAC ATCACAAAGA
TGTAGCAGAT GTGGGCATGT TAGCAAAAGT AGCAGGAGAT ACAGGGGATT GTATGTATGC
AGAAAGTGTG GGTATGTAAT AAATGCGGAT ATAAATGGAG CGATAAACAT AGTTGCAAAG
GTAGCTGGTG AGTCTGTGGT GAAGCAGATA ACCAGTAGTG GGTGTGTGAA CCACCCTGTG
AGAATAAGGG TAGCTTGA
 
Protein sequence
MYKTQKNHIR CDKQTYRLLR QLCHFSKNLY NYALYHIRQH YFKTQEYLRY ESVYHLVKGN 
ENYRLLPSQV AQQTLISVDE AFKSFLSLLK AKKEGKVKEK VSMPKYLPKD GMYQIVFPKD
QFKTEGKKVR LSFGRGFAKE FGVKYLYFDL PATVLGKKIR EVRIVPRLGG RWFEIEYVYE
DKQQPRIFDL SRYLGVDLGL DNFAAVVDTI GTAFLIEGRF LKSVNRWYNK ERARLQSIYS
KQGIKCGRRL AQISLERQHV IDNFLNQAVS LIIKHCLNNQ IGAVVVGRMK GIKQGIELGV
VNNQNFVGIP YDKFKRKLKS KCMYYGIRYM EVDEGYTSQR CSRCGHVSKS SRRYRGLYVC
RKCGYVINAD INGAINIVAK VAGESVVKQI TSSGCVNHPV RIRVA