Gene Athe_1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1029 
Symbol 
ID7409585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1122771 
End bp1123973 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content36% 
IMG OID643715394 
Productintegrase family protein 
Protein accessionYP_002572903 
Protein GI222529021 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTACCA AAACAAAAAA GAGAGGGAAT AATGAAGGCA GCATATACAA AAGAAAAGAT 
GGGCTCTGGT GCGGTCAAAT CACCATAGGA AGAGATGAAA ACGGCAGACA AAAGCGGCAG
TATTTCTATG GCAAGACAAG ACAAGAGGTT GCTGAAAAGA TAGCAAAGAC ACTAAATGAC
TTAGCCAATG GAGTATATGT TGACCCTGCA AAGACAACGT TGAAAGATTG GCTTAACACA
TGGCTTTGGG AATATAAAAA GCAGACATTG CGACCTTCAA CGTTCAAAGA TTATTTGTGC
TATATAGAAA GACATATAAA TCCTGCCATT GGTCATTATA AGTTAAAAGA TTTGAGACCT
GAACATCTTC AAGCTTTGTA TAATGCCAAA TATCAAGAAG GTTTGAGTAT AAGTACAATT
AAGCAGATTC ATACTGTTTT GCATTCAGCT TTAGACCAAG CTTTAAAGAA TGGACTTGTC
AACAGAAATG TTTCAGAGGC AACCACTTTA CCAAAAGGCA AGCCAAAAAG AGAGATAAGG
ATACTAAGTT TAGAAGAACA ACAAAGATTT ATTGCAGCTT TGGAAGGGGA AAGATTAAAA
ACTGCGTTTC TTGTTGAGTT AGCAAGTGGA CTTAGAATTG GTGAACTTTT AGCTTTGCGC
TGGAAAGATG TCAATTTCAA GGATGGATAC ATTGAAGTTA GACGGTCTTT ACAGCGTGTA
AGGATTTTTG ATGGAGGCAA TTCTAAAAAG ACTGCACTTG CTTTTCAAGA ACCTAAAACA
GAAGCAGGTA AAAGAATAGT GCCTTTGCCA CCAGTAATAA TTGAAGAGTT AAAACAGCAC
AGAAAAAAAC AGTTAGAAGA AAAACTGAAA GCTGGAGCGC TTTATGAAGA TAACGATTTA
GTATTTGCAA CAGAGCTTGG TACCCCAATT GACCCAAGAA ATTTTGAAAG GCTTTTTTAC
AGAATTAGAG AAAAAGCGGG ACTTGACAAG AGTGTCAATT TTCATGCATT AAGACACACA
TATGCAACAA GGCTTTTAGA AGCAAATGAA CATCCCAAAG TTGTTCAAGA GCTTTTAGGA
CACAAAGATA TTTCTACAAC CCTCAATATT TATTCTCATG TTATGCCTGA GATAAAGAAA
GCTGCTGCAA TGAAATTAAA CAGTTTATTT GAGAATATAA AAACAAAGGG TAACCACTCC
TAA
 
Protein sequence
MPTKTKKRGN NEGSIYKRKD GLWCGQITIG RDENGRQKRQ YFYGKTRQEV AEKIAKTLND 
LANGVYVDPA KTTLKDWLNT WLWEYKKQTL RPSTFKDYLC YIERHINPAI GHYKLKDLRP
EHLQALYNAK YQEGLSISTI KQIHTVLHSA LDQALKNGLV NRNVSEATTL PKGKPKREIR
ILSLEEQQRF IAALEGERLK TAFLVELASG LRIGELLALR WKDVNFKDGY IEVRRSLQRV
RIFDGGNSKK TALAFQEPKT EAGKRIVPLP PVIIEELKQH RKKQLEEKLK AGALYEDNDL
VFATELGTPI DPRNFERLFY RIREKAGLDK SVNFHALRHT YATRLLEANE HPKVVQELLG
HKDISTTLNI YSHVMPEIKK AAAMKLNSLF ENIKTKGNHS