Gene Teth514_1053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_1053 
Symbol 
ID5876592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp1087901 
End bp1090684 
Gene Length2784 bp 
Protein Length927 aa 
Translation table11 
GC content37% 
IMG OID641541408 
Productexcinuclease ABC, A subunit 
Protein accessionYP_001662688 
Protein GI167039703 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00826345 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAAG ACAAAATTGT AATTAAAGGG GCAAGAGTTC ATAATTTAAA AAATGTAGAT 
TTGGAGATTC CACGAGATAA ATTAACTGTG ATAACGGGTT TATCAGGTTC TGGTAAGTCT
TCTCTTGCCT TTGATACCAT TTATGCTGAA GGTCAGAGAA GATATGTGGA GTCGTTGTCA
GCTTATGCAA GGCAATTTTT AGGACAAATG GACAAACCAG ACGTTGATTA TATAGAAGGT
CTTTCACCGG CAATATCTAT AGACCAAAAG ACCACAAATA AAAACCCTCG TTCTACGGTA
GGAACGATAA CTGAGATTTA CGACTATCTA AGACTTTTAT ATGCAAGGGC TGGGATTCCC
CATTGTCCTG TTTGTGGAAA AGAGATTAGC ATGCAAACTA TAGATCAGAT GGTAGACAGA
GTCAAAGAAT TGCCAGAAGG TACAAGGATA CAGGTACTGG CTCCTGTTAT TAGAGGAAGA
AAAGGGGAAT ATGCTAAGCT TTTAAATGAC ATAAAAAAGA GCGGATATGT CAGAGTGAAA
ATTGATGGAG TCATGTACGA TGTAAATGAG GAGATTAAAC TTGACAAAAA TAAAAAGCAT
ACTATTGAAG TTGTGGTAGA CAGGGTTATT ATAAAACCGG GAATAGACAT GAGGTTGACA
GACTCTATAG AAACAGCTTT AAAATTAGCA GATGGGATAG TTTCTATTGA TGTAATAGAC
GGAGAAAGCT TTACACTCTC TGAAAAATAC GCTTGTACAG AGTGCAACAT AAGTATTGAA
GAACTTTCCC CGAGAATGTT TTCCTTCAAT AGCCCTTATG GGGTTTGCCC TGTTTGCACT
GGATTGGGAG AATTTATGAA GGTAGATCCA GAGCTTTTAA TACAAGACCC TAAAAAATCA
TTAGCAAATG GGTTGTTGCC GGGGATTGTT GCTTCACAGG ATAGTTATGC TTATTACAAC
ATTTTAAGAT TAATTGAACA TTTTGGATAT ACAGAGAATA CTCCTTATGA AAAGTTTAGT
GAAGATTTAA AGAATGTACT GCTTTATGGC AAAGATACAA AAGGTAAGTC CTATGGATTT
GAAGGTATAG TAAACAATCT TGAAAGAAGG TACAACAATA CTTCTTCAGA TTTTATAAAA
GAAGAGATAG AGAAATATAT GAGACCGGTT ACTTGTCCTG CATGCCATGG GGCTAGATTA
AAGCCAGAAG CGCTAGCTGT GACTGTTGGT GGACTTTCCA TAAAAGAAAT GACAGACCTT
TCTGTTGGCG AGCTTATAAA ATTTATTGAG GAACTCAAAT TAACAGAAAA ACAAGAGATT
ATTGCAAAGC CAATTTTAAA AGAAATAAAG GCAAGGCTTA ATTTTCTTGT GGATGTAGGA
CTGGATTATT TGACTCTTTC AAGACCTGCA GCTACTTTAT CAGGTGGAGA AGCACAAAGG
ATAAGATTGG CAAGCCAGAT AGGCTCTGGG CTTGTAGGAG TCACATACAT CCTGGACGAG
CCGAGCATTG GACTTCATCA AAGAGATAAT GAAAGGCTTA TAAATTCCTT GAAAAAATTA
AGAGACCAAG GCAATACTCT TATAGTAGTA GAGCACGATG AGGATACAAT ATATGCGGCA
GACTACATTG TGGATGTAGG ACCAGGCGCA GGTGAGCATG GAGGAGAAAT TGTAATTGCG
GGTACGATAG AAGATGTGTT AAAATGTGAA AAATCAATTA CAGGCCAGTA TTTAAGTGGT
AAAATAAAGA TAGAAGTGCC AAAACAGAGG AGAAAACCTA ATGGAAAAGC TTTAATAGTG
AAAGGAGCTA AGGAAAACAA TTTAAAGAAT ATAGATGTGG TTTTCCCCCT CGGAGTATTT
ATATGTGTTA CAGGGGTTTC TGGCTCAGGC AAAAGCACCC TTATAAATGA GATACTGTAC
AAAGCATTGG CACAGAAGAT TTATAAGTCC AAAGATAAAC CAGGTATGCA CGATGCAATA
GAGGGTATCG ATAATATAGA TAAAGTAATA AATATTGACC AGTCTCCTAT AGGCAGGACT
CCTCGCTCAA ATCCTGCTAC CTATACAGGA GTTTTCGACT ATATAAGAGA GGTTTTTGCA
AATACTCCAG AAGCTAAAAT GAGAGGCTAT AAACCAGGAA GATTTAGTTT TAATGTTAAA
GGTGGAAGGT GTGAAGCCTG TGGTGGAGAT GGTATAATTA AAATTGAGAT GAACTTTTTG
CCGGATGTGT ATGTCCCTTG TGAAGTCTGC AAAGGGCAAA GGTATAATAG GGAAACATTG
GAAGTAAAAT ATAAAGGGAA AAATATTTCG GATGTACTTA ATATGACGGT AGAAGAGGCG
TTAGAATTTT TTGAAAATAT ACCCAGGATA AAAAATAAAT TGATGACTTT ATATGATGTG
GGTTTAGGAT ATATTAAGCT AGGGCAACCT TCTACTCAGC TTTCAGGAGG AGAAGCACAA
AGAGTAAAAT TAGCTACTGA ACTTTCTAAA AGGCCTACAG GTAAAACACT ATATATTTTG
GATGAGCCTA CAACTGGATT ACATTTTGCA GATGTGCATA GGCTTCTTGA AGTTTTAAAT
AGATTAACTG ATGCGGGCAA TACTGTTATT GTAATTGAAC ACAACTTAGA TATCATAAAA
AGTGCAGACT ATATCATTGA CTTAGGACCA GAAGGTGGAG ACAAGGGTGG AAGGGTAATA
GCTACAGGGA CACCTGAAGA AGTTGCTGCT AATGAGAATT CTTATACAGG CCATTTTTTG
AAAAAAGTCC TTTCTCAAAA ATGA
 
Protein sequence
MAKDKIVIKG ARVHNLKNVD LEIPRDKLTV ITGLSGSGKS SLAFDTIYAE GQRRYVESLS 
AYARQFLGQM DKPDVDYIEG LSPAISIDQK TTNKNPRSTV GTITEIYDYL RLLYARAGIP
HCPVCGKEIS MQTIDQMVDR VKELPEGTRI QVLAPVIRGR KGEYAKLLND IKKSGYVRVK
IDGVMYDVNE EIKLDKNKKH TIEVVVDRVI IKPGIDMRLT DSIETALKLA DGIVSIDVID
GESFTLSEKY ACTECNISIE ELSPRMFSFN SPYGVCPVCT GLGEFMKVDP ELLIQDPKKS
LANGLLPGIV ASQDSYAYYN ILRLIEHFGY TENTPYEKFS EDLKNVLLYG KDTKGKSYGF
EGIVNNLERR YNNTSSDFIK EEIEKYMRPV TCPACHGARL KPEALAVTVG GLSIKEMTDL
SVGELIKFIE ELKLTEKQEI IAKPILKEIK ARLNFLVDVG LDYLTLSRPA ATLSGGEAQR
IRLASQIGSG LVGVTYILDE PSIGLHQRDN ERLINSLKKL RDQGNTLIVV EHDEDTIYAA
DYIVDVGPGA GEHGGEIVIA GTIEDVLKCE KSITGQYLSG KIKIEVPKQR RKPNGKALIV
KGAKENNLKN IDVVFPLGVF ICVTGVSGSG KSTLINEILY KALAQKIYKS KDKPGMHDAI
EGIDNIDKVI NIDQSPIGRT PRSNPATYTG VFDYIREVFA NTPEAKMRGY KPGRFSFNVK
GGRCEACGGD GIIKIEMNFL PDVYVPCEVC KGQRYNRETL EVKYKGKNIS DVLNMTVEEA
LEFFENIPRI KNKLMTLYDV GLGYIKLGQP STQLSGGEAQ RVKLATELSK RPTGKTLYIL
DEPTTGLHFA DVHRLLEVLN RLTDAGNTVI VIEHNLDIIK SADYIIDLGP EGGDKGGRVI
ATGTPEEVAA NENSYTGHFL KKVLSQK