Gene Teth514_0197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_0197 
Symbol 
ID5877392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp198013 
End bp199233 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content39% 
IMG OID641540539 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001661851 
Protein GI167038866 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTAG TTTACTCTCA CGTTTGCGGA TTAGATGTCC ATAAAAAGAA TGTCGTAGCT 
TGTATAATAA CACCAGAAGG TAAAGAAATC CGCACTTTTT CAACTATGAC CGATGACCTT
ATTGCATTAA AAGAATTTAT TAAAGCTAAA GGTTGTTCTG TTGTTGCTAT GGAAAGTACC
GGCTCTTATT GGAAACCTAT TTACAATCTA CTTGAGCTTG AGAGCATTAA AATCCTACTC
GTCAATGCTA AGCATATTAA AAATGTCCCT GGTAGAAAAA CCGATGTAAA AGATGCTGAG
TGGATAGCAA GTCTCTTACA ACATGGCCTT TTGCAAGGCA GCTTTGTGCC AGATCGTGAA
CAAAGAGAAC TTCGCGAGCT TGTACGCTAT AGAAAAAGCC TCATTGAAGA AAAATCAAGA
GAACTTAATC GCATACAAAA GGTTTTAGAA GGAGCTAATA TCAAACTGTC TTCGGTAGTC
TCTGATATCA ACGGGGCATC CAGTCGTTCT ATACTTGAGG CTATTATAAA TGGTGAAGAA
AATCCCGAAA CCCTGGCTGA GCTTTCTCAA GGCAAGCTAA AAAATAAAAT GGATGAACTA
AAACGCGCTT TAAAAGGCTT GATCAATCAT CACCAAAGGA TGCTTCTGGA AATACAGCTT
AGACATATTG ATTACCTTGA TGAAGAAATA GCAAAATTAG ACGAAGAAAT TAAAAATCGA
ATGCTCCCTT TTGAAAAAGA CCTGGCACTG CTGGATACAA TCCCTGGAGT CGGAAGAAGA
ACTGCAGAAC AAATAATAGC CGAAATCGGC ACGAATATGG AACAGTTCCC CTCTGCTGCC
CATTTGTGTT CTTGGGCAGG GTTGTGTCCA GGTCATAATG AAAGTGCTGG TAAACAAAAG
TCTGCAAGAA CTCGAAAAGG TAACCAAAAA TTGCGAAGCT CTCTTATTGA AGCTGCCAGA
GCTGCCTCAA GGGCAAAAGA TACTTATCTC TCAAGTCAGT ACCACCGCAT CGCTGCTCGA
AGAGGAGCAA ACCGTGCAGC AGTTGCAGTG GCACATAGCA TTTTAATTAT AGTTTATCAT
ATTCTCAAGC AAAAGCAACC ATATATTGAA TTAGGTCCTA CTTATTATGA AGAGAAAAAG
CGTAATATGA TTATTCGTCA ATCTTTAAAA AAGCTAGAGT CTTTAGGCCT TAAGGTCACG
GTCGAATCTG CAGTGTCTTA A
 
Protein sequence
MDLVYSHVCG LDVHKKNVVA CIITPEGKEI RTFSTMTDDL IALKEFIKAK GCSVVAMEST 
GSYWKPIYNL LELESIKILL VNAKHIKNVP GRKTDVKDAE WIASLLQHGL LQGSFVPDRE
QRELRELVRY RKSLIEEKSR ELNRIQKVLE GANIKLSSVV SDINGASSRS ILEAIINGEE
NPETLAELSQ GKLKNKMDEL KRALKGLINH HQRMLLEIQL RHIDYLDEEI AKLDEEIKNR
MLPFEKDLAL LDTIPGVGRR TAEQIIAEIG TNMEQFPSAA HLCSWAGLCP GHNESAGKQK
SARTRKGNQK LRSSLIEAAR AASRAKDTYL SSQYHRIAAR RGANRAAVAV AHSILIIVYH
ILKQKQPYIE LGPTYYEEKK RNMIIRQSLK KLESLGLKVT VESAVS