Gene Athe_2506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2506 
Symbol 
ID7409375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2633071 
End bp2634354 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content42% 
IMG OID643716869 
Producttransposase 
Protein accessionYP_002574347 
Protein GI222530465 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01765] transposase, putative, N-terminal domain
[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000109626 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTAA CGTTATCCTG TAAATTTAAA CTCTGGCTGT CACAAGAACA AAAAAGAAAA 
CTCATCGAAA CTGCAAAAAC ATATACCAGT GCTATTAACT TTGTCTTAGC CGAAAATCTG
AAAGACAAAA CAACTAACGT AAAGAAACTA CACAAACTCT ACTACAAAAC CATCAGAAAA
AAGTTTTCTC TTCCCTCCCA GCTGGCTATC AATGTCTACC GACAAGTTGC AGACATATAC
CAAACCCTGT GGGCACAATA TAACAACCTG CTGTACAGAG AGCAAAATAG CAACAACAAT
AGTGGTACCG CTGAAGAATT CTGGAGTAAA CCACCAAAAC GAAAAACTCT CACAGTAAAC
TACACGCACG GTCGCACCTT CTCCATCAAG TACGACAAAA ATACAGACAC CTTCTTCGTA
TCCATTTCCT CCATCCATGG CAGAATTAAA AACGTTCGAA TCACCGGTTG GAAACAACAC
TACAACTATT TAAAACACGG CGAAATAGGT GACCCTGTGC TTGTCTATGA TAAACCATCA
AAAGAATTTT ACCTGCACAT CCCAGTAACC CTGGAAATCG ACGAAAAACT GCACAAAGAA
ATTGCTGGTA TAGACGTTGG AGAGAGAAAT ATTGTAACAG TAGTGTCAAC TGCTGGTGCG
AGATATACTA TACCACTTCC TGACCAGGTT AGACGTACCA AGCGTCACTA TCACGAGTTG
CGCTCTCAGT TGATGTCAAA AGGCACTCGC TCTGCCAGAA GAAAACTCCA AAAGATTGGC
ATGAGCGAGA AACGGTTCGT GTCCAACTTT CTACATAAAC TCACTAAGGA CCTTGTCAGG
AAGCACCCGG CAGCACTATT TGTCATGGAA GATTTGAGCA TGATCAGAAC AAACAGGATA
ACGTATCGTG GCAATGATAG TGAAGCGCGC CGCCAAGCAG AACAATGGCC TTTTGCCGAA
CTACAAAACA AATTGGAGTA CAAATCAATA CTCTACAATG GAATATGTTC AGTCAAAGTT
GACCCTTCGT ATACTTCGCT ATCCTGTCCT GTTTGTGGAC ATGTATCGAA AGACAACCGC
CCCGGACATG GTGAACTATT TAAGTGTCAG CGCTGCGGTT ATGAAGAAAA TGCTGACATA
GTAGGCGCAA CGAATATAGC AATAAGGTAT CTTGTGGAAG TTCAGCAGAT GAACCTGAGA
GGGCTGCTTG TCAACCAGCC TAATGTTCCC TGTTTGCAAA AACAGGTAGA GCAAGCTCCT
ACCTCTATAG GTAGGAGCAG TTGA
 
Protein sequence
MKLTLSCKFK LWLSQEQKRK LIETAKTYTS AINFVLAENL KDKTTNVKKL HKLYYKTIRK 
KFSLPSQLAI NVYRQVADIY QTLWAQYNNL LYREQNSNNN SGTAEEFWSK PPKRKTLTVN
YTHGRTFSIK YDKNTDTFFV SISSIHGRIK NVRITGWKQH YNYLKHGEIG DPVLVYDKPS
KEFYLHIPVT LEIDEKLHKE IAGIDVGERN IVTVVSTAGA RYTIPLPDQV RRTKRHYHEL
RSQLMSKGTR SARRKLQKIG MSEKRFVSNF LHKLTKDLVR KHPAALFVME DLSMIRTNRI
TYRGNDSEAR RQAEQWPFAE LQNKLEYKSI LYNGICSVKV DPSYTSLSCP VCGHVSKDNR
PGHGELFKCQ RCGYEENADI VGATNIAIRY LVEVQQMNLR GLLVNQPNVP CLQKQVEQAP
TSIGRSS