Gene Athe_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2026 
Symbol 
ID7408238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2136913 
End bp2138091 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content34% 
IMG OID643716392 
Productintegrase family protein 
Protein accessionYP_002573876 
Protein GI222529994 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAGGC GCGGTAAAGG TGAAGGTAGT ATTTTCAAAA GAAAAGATGG AAGATGGTGT 
GGCTTTATTA CTGTTGGCTA TGATGAAAAA GGAAATCAAA AAAAGAAATT CTTTTACGGC
AAAACAAGGC AGGAAGTTGC TGAAAAAATA AATCAAGCAC TAAATGAAAT TAAACAAGGA
ATTTTAATAA CTGACAATAA TATTACACTT GAAAATTGGC TTAACATCTG GCTGCATCAG
TACAAAAAAA ATCAAATTAG TGAATCAACT TTTGATGATT ATGAAAGCAT AATAAAAAAT
CACATAAATC CTGTACTTGA AAAATATAAC CTCAAAGATT TGCGTCCAGA ACATCTACAA
ATGCTTTACA ATGAAAAACA TAAAGCAGGT CTTTCGACAA AAAGAATCAA GGATATTCAT
GTCATCCTAC ATTCAGCTTT AAATCAAGCA ATTAAAAACG GACTCATTGT ACGAAATGTC
AGTGAAGCAA CCACCTTACC AAAAAACACC AGAGAAAAGG AAATGAAAGT TTTGACAATA
GAAGAACAGA AAAGATTTCT GCAGGTACTT GAAGGTGAAA GATTGAAACC TGCCTTTGTT
CTTGCCTTGA GTACTGGAAT GCGACTGGGA GAAATTTTGG CTTTGAAGTG GCAAGATGTC
GATTTAGAAA ACAAAAGAAT TACTATTAGA AATTCTGTCC GCAGGATAAA AAACAGGAAT
GAGCAGTCAG AAATTAAAAC TAAAACTGTT CTTGTTCTTA AAGAACCTAA AACCGAAAAT
TCTGGAAGAA TAATTCCACT GCCAGATGTT GCCTATCAAG AACTTGTTAA TTTCAAACTA
TTGCAGGAAG AAGAAAAAAG ACAAGCAGGT AGTAGCTACG TAGATAGTGG TTTTGTCTTT
ACAACCAAAG TTGGAACACC TATTGAGCCA AGAAACTTCC TGCGGACATT TTACCGTATT
ACAGAAAAAG CAGGACTTAA TATTAACTTC CATGCTCTAA GACACACATT TGCAACAAGA
CTTTTAGAAG CAAACACTAA CCCTAAAGTT GTTCAAGAGC TGCTGGGACA CAGTGATATA
TCAACCACAT TGAATATTTA TTCGCATGTA TTGTTTGACA CAAAACAGAA AGCTATTGGG
GAAATTAATG ATTTAATGAA AAATCTTACC AATGAATGA
 
Protein sequence
MGRRGKGEGS IFKRKDGRWC GFITVGYDEK GNQKKKFFYG KTRQEVAEKI NQALNEIKQG 
ILITDNNITL ENWLNIWLHQ YKKNQISEST FDDYESIIKN HINPVLEKYN LKDLRPEHLQ
MLYNEKHKAG LSTKRIKDIH VILHSALNQA IKNGLIVRNV SEATTLPKNT REKEMKVLTI
EEQKRFLQVL EGERLKPAFV LALSTGMRLG EILALKWQDV DLENKRITIR NSVRRIKNRN
EQSEIKTKTV LVLKEPKTEN SGRIIPLPDV AYQELVNFKL LQEEEKRQAG SSYVDSGFVF
TTKVGTPIEP RNFLRTFYRI TEKAGLNINF HALRHTFATR LLEANTNPKV VQELLGHSDI
STTLNIYSHV LFDTKQKAIG EINDLMKNLT NE