Gene Athe_2540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2540 
Symbol 
ID7409410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2658064 
End bp2659185 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content39% 
IMG OID643716904 
Productintegrase family protein 
Protein accessionYP_002574381 
Protein GI222530499 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000519986 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGGAC ATATTGCAAA AAAAGGAAAG AAGTATTATA TTGTTGTGGA TATTGGTGTA 
GTTGACGGAA AGCGCAAACA AAAGTGGCTG GGTGGTTTTG AAACAAAAGC CGAAGCAGAA
GAAGCTTTAC CAAGGATTCT AAACATCATC TACCAGAACA AAAATAAGCT ACTGGAGCAG
AAGACTTTTG GTGAACTTTT AGACTTATGG CTCAACACGG TAGCAAAAAA CAGGGTTTCT
CTGAGTACCT TTCAATCATA CTCTAGTTCA ATTGCACTGC ACATAAAACC TTTTCTGGGT
GATGTAGAGC TGAACAAACT GACAGCAATG GATTTACAAA AGTACTATCA GCACATCACA
GGTGAGAAGG GTCTTTCATC AACACTTGCG TTATACCACC ACCGTATAAT TCATCAGGCT
CTTGATTATG CAGTGAGAAT GAACATGATT GAAAAAAATC CAGCTGATTA TGTAGACCCA
CCGAGAAAGC GAAGTTATCA GCCTTCAATC TGGGATGAGG ATACAATTAA AAGAGCACTT
GATATTTTTC GAAACACTAA TATCTGGGTA CCTGTGCTTT TAGCAATTTA CACAGGAATG
CGGATTGGTG AGATTACTGC ACTGAAGTGG GATGACATAA ATTTAGAGCA GGGCTATATA
GTTGTTAGCA AGACACTTAT GAAGATAAAC AGGCAGTTGT ACATCAAAGA AAAGGGAGAA
AACAAAAGTA GCTACAGGAT AGTAGCAATT TCAAGCAATG TAATAGAGGA ACTCAAGCAG
TGGAGGAGAA AACAAGAGCA GTACAAAGAA AAGCTAAAGG AACTTTATCA TAATGAAGGT
TTTGTGTGCA CATTTGAAGA TGGAAGAGTA CAGGACCCAA AGTATATTAC TAAAGTATTC
CAGAAGACAA TAGAAAAGCA TGGACTGCCA AAGATACGAT TTCACGACCT CAGGCACACC
CATGCTTCAC TTCTTCTGAA ACATGGTGTA GACCCCAAGC ACATCAGTGA CAGACTGGGG
CACAGTACCA TAACCACCAC ACTCAACATT TACAGTCATG TATTTGATGA AAAGCGGCGC
AAAGTGGCAG AAACTTTTGA GGACATTTTA AAAAACCAAT AG
 
Protein sequence
MRGHIAKKGK KYYIVVDIGV VDGKRKQKWL GGFETKAEAE EALPRILNII YQNKNKLLEQ 
KTFGELLDLW LNTVAKNRVS LSTFQSYSSS IALHIKPFLG DVELNKLTAM DLQKYYQHIT
GEKGLSSTLA LYHHRIIHQA LDYAVRMNMI EKNPADYVDP PRKRSYQPSI WDEDTIKRAL
DIFRNTNIWV PVLLAIYTGM RIGEITALKW DDINLEQGYI VVSKTLMKIN RQLYIKEKGE
NKSSYRIVAI SSNVIEELKQ WRRKQEQYKE KLKELYHNEG FVCTFEDGRV QDPKYITKVF
QKTIEKHGLP KIRFHDLRHT HASLLLKHGV DPKHISDRLG HSTITTTLNI YSHVFDEKRR
KVAETFEDIL KNQ