Gene Teth514_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_2046 
Symbol 
ID5876249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp2055837 
End bp2058212 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content33% 
IMG OID641542392 
Productrecombination and DNA strand exchange inhibitor protein 
Protein accessionYP_001663654 
Protein GI167040669 
COG category[L] Replication, recombination and repair 
COG ID[COG1193] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01069] MutS2 family protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGAGGG GAATTAATAG TAGAGCAATA AAAAGTCTTG AATTTGACAA GATAGTGGAG 
TTTATTGTTG GCTATTGTGA TTCGGATTTA GGCAAACAAA AAGCTTTAGA TATTGTCATA
AAAAAAGACA TAGAGGAAAT AGAGAGGGAA TTAGATTTAC TGAATGAGGC AATTAGCTTT
ATTTCCTCCT ATGGAGGTAT TTCTTTTGCT TTTGAGGACA TAAGGGATTA TATAAAAAAA
GCACAAATAG ATTCCGTGCT TTATAACCAA GAGCTTTTAA AGATAAAGAA ATTTCTTAAC
TTGGTAAGCC AAATAAAAGG CTATTTTAAA AATCTTCAAG AAAGTGATAG ATTTGTAAGG
TTAAAAGAAT ATGATAAAAA AGTCTTACCG ATAAAAAATT TGGAAAAAAG GATTGAAAAT
ATAATAATTT CAGAAGATGA AATAGCAGAT GATGCTTCTC CAATGCTTAA AGCTTTAAGA
AGGCAGAAAT TGAGCATAAA TGAAAAAATA AGGGCGACAT TGAATTCAAT AATTTCTACA
CGTCAAAAAG AATTGCAAGA GCCTATTATA ACTGTAAGAC AGGGAAGATA CGTTGTACCT
GTAAAACAAG AATATCGCAG CACTTTTAAA GGTATCGTCC ACGACCAATC CTCCAGTGGT
GCTACTCTTT TTATTGAACC TATGCAAGTG GTTGACTTAA ACAACGAATT GAGACAAGTG
GAATTAAAAG AGAAGCAGGA GATACAGAGA ATACTTTTTG AACTTTCTCA AGAGGTCAAA
AAGTATTCAC AAATTTTATT TAATGACATT GAAATAGTTT CAGAATTAGA TTTTATATTT
GCTAAGGCTA AATATTCTTT AAAGCTAAAA GCTGTAAGGC CAGAGCTAAA CACAATGGGA
TATATTAATT TAAAAAAAGC AAGGCATCCT CTCATAAATC AGGAAGTAGT TGTTCCTATA
GATATACATA TAGGAAAGCA ATTTAATACT TTAGTCATTA CTGGTCCTAA TACCGGTGGG
AAAACTGTAA CTTTAAAAAC AGTGGGACTT TTAACTTTAA TGGCAATGGC GGGGCTTAAT
ATTCCTGCGG AAGAAAAGTC ACAGGTTTCA ATATTTGAGG AAGTTTTTGT GGATATAGGA
GATGAGCAGA GTATTGAACA AAGCTTAAGC ACTTTTTCTT CTCATATGAC GAATATTGTA
AGTATACTCC AAAAAGTAAA CAAAAATTGC CTTGTTTTGT TAGATGAATT AGGGGCAGGT
ACAGATCCTA TAGAAGGTGC TGCTTTGGCT ATGAGTATTC TCGATACCTT GCATAAGATT
GGTGCGAAGA CAATAGCCAC TACTCATTAT AGTGAGCTAA AGCAGTATGC CTTAAAAATT
CCAGGAGTAG AAAATGCCAG TGTGGAATTT GATGTTGAAA CTTTAAAGCC TACTTATAAA
CTTATAATAA GCCTTCCTGG CAAAAGCAAT GCCTTTGAAA TATCTAAAAG ATTGGGACTT
CCTCAGCAAA TAATAGAAAA TGCGAGAAAG TATATTTCAG GAGAAGCTTT AAAATTTGAA
GACATAATTG CAGACGTCGA AAGTAAACGA AGGGAATTGG AAAAGGCAAA TCACGAAATA
GCTTTTTTAA AGAAAGATGT GGAAATTTTA AAAGAGGAAT TAGAAAAAGA AAAGAAAAAA
TTGCAAAGTG AAAGGGATAA AATATTAAAA GAGGCGAAAG AAAAGGCGAG GAAAATAATA
CAAGAAGCGA AATTTACTGC TGAAGAAATA ATCAAAAAGA TAAGAGAGGC AGAAGAAAGC
ACACAAAATA AAGACAGGAT AATACAAGAA GTAAGAGAAG AATTAAAGAA AAATTTAGAA
GAATTAGAAG AAGAAGTTTT AAAGCCTAAA GAGGCTCACT ACAGCAGAAT CCCTGATAAT
TTAAAAGAGG GACAGACAGT CTATATAGTA CCTTTGGACC AAAATGGGAT TGTACTTTCT
CTTCCTGACA AATCGGGAAA TGTAGAAGTG CAGGCAGGAA TTTTAAAGAT GACAGTTCAT
ATAAGTAATT TGAGGGTAGC AGAGGAGAAA GAAGAGGAGG AAGTAAAAAA AGGCTACAGC
AAATTTGTAC ACGAAAAGTC TCAATCTATA AGCACTTCCA TAGATGTAAG AGGTAAAAAC
CTTGACGATG CTTTATTAGA GGTGGAAAAA TATATAGACG ATGCTTATTT AGCTGGACTT
AAGGAGGTGA CAATTATTCA CGGACGTGGT ACAGGGGTGT TAAGGACAGG GATATCACAA
TTTTTAAGAA GCAATAAGCA TGTTAAATCT TTTAGATTAG GTAAATACGG TGAAGGAGGA
GACGGTGTTA CAATAGTAGA ATTAGCCAAT AAATAG
 
Protein sequence
MVRGINSRAI KSLEFDKIVE FIVGYCDSDL GKQKALDIVI KKDIEEIERE LDLLNEAISF 
ISSYGGISFA FEDIRDYIKK AQIDSVLYNQ ELLKIKKFLN LVSQIKGYFK NLQESDRFVR
LKEYDKKVLP IKNLEKRIEN IIISEDEIAD DASPMLKALR RQKLSINEKI RATLNSIIST
RQKELQEPII TVRQGRYVVP VKQEYRSTFK GIVHDQSSSG ATLFIEPMQV VDLNNELRQV
ELKEKQEIQR ILFELSQEVK KYSQILFNDI EIVSELDFIF AKAKYSLKLK AVRPELNTMG
YINLKKARHP LINQEVVVPI DIHIGKQFNT LVITGPNTGG KTVTLKTVGL LTLMAMAGLN
IPAEEKSQVS IFEEVFVDIG DEQSIEQSLS TFSSHMTNIV SILQKVNKNC LVLLDELGAG
TDPIEGAALA MSILDTLHKI GAKTIATTHY SELKQYALKI PGVENASVEF DVETLKPTYK
LIISLPGKSN AFEISKRLGL PQQIIENARK YISGEALKFE DIIADVESKR RELEKANHEI
AFLKKDVEIL KEELEKEKKK LQSERDKILK EAKEKARKII QEAKFTAEEI IKKIREAEES
TQNKDRIIQE VREELKKNLE ELEEEVLKPK EAHYSRIPDN LKEGQTVYIV PLDQNGIVLS
LPDKSGNVEV QAGILKMTVH ISNLRVAEEK EEEEVKKGYS KFVHEKSQSI STSIDVRGKN
LDDALLEVEK YIDDAYLAGL KEVTIIHGRG TGVLRTGISQ FLRSNKHVKS FRLGKYGEGG
DGVTIVELAN K