Gene Teth514_1903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_1903 
Symbol 
ID5877409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp1916686 
End bp1917906 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content39% 
IMG OID641542255 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001663519 
Protein GI167040534 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTAG TTTATTCTCA CGTTTGCGGA TTAGATGTCC ATAAAAAGAA TGTCGTAGCT 
TGTATAATAA CACCAGAAGG TAAAGAAATC CGCACTTTTT CAACTATGAC CGATGACCTT
ATTGCATTAA AAGAATTTAT TAAAGCTAAA GGTTGTTCTG TTGTTGCTAT GGAAAGTACC
GGCTCTTACT GGAAACCTAT TTACAATCTC CTTGAGCTTG AGAATATTAA AATCCTACTC
GTCAATGCTA AGCATATTAA AAATGTCCCT GGTAGAAAAA CTGATGTAAA AGATGCCGAG
TGGATAGCAA GTCTCTTGCA ACATGGCCTT TTGCAAGGCA GCTTTGTCCC AGACAGAGAA
CAAAGAGAGC TTCGCGAACT TGTCCGCTAT AGAAAAAGCC TCATTGAAGA AAAATCAAGA
GAGCTTAACC GCATACAAAA GGTCTTAGAA GGTGCTAATA TTAAACTGTC TTCGGTAGTC
TCTGATATTA ATGGAGCTTC TAGTCGCTCT ATTCTTGAGG CTATCATAAA TGGTGAAGAA
AATCCTGAAA CCTTAGCACA GCTTTCCCAA GGTAAATTGA AAAATAAAAT GGATGAACTA
AAACGCTCTT TAAAAGGCCT AATTAATCAT CACCAAAAAA CTCTCATTGA AATTCAACTC
AGGCATATTG ATTATCTTGA CCAAGAAATA ACTAAATTAG ATGAAGAAAT TAAAAATAGA
ATGCACCCTT TTGAACAAGA CCTGGCACTG CTGGATACTA TCCCTGGTGT CGGAAGAAGA
ACTGCAGAAC AAATAATAGC CGAAATCGGT ACAAATATGG AACAGTTCCC CTCTGCTGCC
CATTTGTGTT CCTGGGCAGG GCTGTGTCCA GGTCATAACG AAAGTGCTGG TAAACAAAAG
TCTGCCAGAA CTCGAAAAGG TAACCAAAAA TTGCGAAGCT CTCTTATTGA AGCTGCCAGG
GCTGCCTCAA GGGCAAAAGA TACTTATCTC TCAAGTCAGT ACCACCGCAT CGCTGCTCGA
AGAGGAGCAA ACCGAGCAGC AGTTGCAGTG GCACATAGCA TTTTAGTTAT AGTTTATCAT
ATTCTCAAGC AAAAGCAACC ATATATTGAA TTAGGTCCTA CTTATTATGA AGAGAAAAAG
CGTAATATGA TTATTCGTCA ATCTTTAAAA AAGCTAGAGT CTTTAGGTCT TAAGGTCACG
GTCGAATCTG TAGCGTCTTA A
 
Protein sequence
MDLVYSHVCG LDVHKKNVVA CIITPEGKEI RTFSTMTDDL IALKEFIKAK GCSVVAMEST 
GSYWKPIYNL LELENIKILL VNAKHIKNVP GRKTDVKDAE WIASLLQHGL LQGSFVPDRE
QRELRELVRY RKSLIEEKSR ELNRIQKVLE GANIKLSSVV SDINGASSRS ILEAIINGEE
NPETLAQLSQ GKLKNKMDEL KRSLKGLINH HQKTLIEIQL RHIDYLDQEI TKLDEEIKNR
MHPFEQDLAL LDTIPGVGRR TAEQIIAEIG TNMEQFPSAA HLCSWAGLCP GHNESAGKQK
SARTRKGNQK LRSSLIEAAR AASRAKDTYL SSQYHRIAAR RGANRAAVAV AHSILVIVYH
ILKQKQPYIE LGPTYYEEKK RNMIIRQSLK KLESLGLKVT VESVAS