Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Teth514_1218 |
Symbol | |
ID | 5877798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoanaerobacter sp. X514 |
Kingdom | Bacteria |
Replicon accession | NC_010320 |
Strand | - |
Start bp | 1258950 |
End bp | 1261922 |
Gene Length | 2973 bp |
Protein Length | 990 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641541568 |
Product | hypothetical protein |
Protein accession | YP_001662848 |
Protein GI | 167039863 |
COG category | [V] Defense mechanisms |
COG ID | [COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases |
TIGRFAM ID | [TIGR00348] type I site-specific deoxyribonuclease, HsdR family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGCTA CTAATACACG AGAAAGTGGT CTTGAATCTT TGATTGTAGA TTGGCTTGTA AATCAAAATG GTTATGAACA AGGCAGCAAT GCTGACTATA ACCGTGACTA TGCTATTGAT GAAACACGCC TATTTCGTTT TCTTTCAGCG ACGCAGCCAG ATGAAATGGA GAAACTCGGT GTATTTAAAA GCGACTTAAA AAAGGCTCAG TTTCTAAACC GATTGCGTGG TGAAATAGCA AAACGCGGAA TTATTGATGT ACTTCGTAAT GGTATTAAGG TTTATCCTGC TGACCTGGTT ATGTTTTATC TAACACCAAG TGAGAGAAAC ATAAAAGCAA AAGCTCTATT TGAGCAGAAT ATTTTCAGTG TTACACGACA ACTGCAGTAT TCAAAAGATG CGACTCGTCT TGCCCTTGAT TTGTGCATTT TTATCAATGG CTTGCCAGTT ATAACATGCG AGCTTAAGAA TCAACTTACA AAGCAAAATG TTGATGATGC TGTTTACCAA TATAAAACGG ATCGTGATCC GAAGGAACTG CTTTTCCAAT TTAAACGCTG TATGGTTCAT TTTGCAGTAG ATGATGCAAG GGTCAAGTTC AGTACTAAGC TTGATGGTAA AGCTTCCTGG TTCTTGCCAT TTGACAAAGG GTACAATGAT GGAGCCGGCA ACCCTCCAAA TTCTTCTGGT ATAATGACAG ATTATTTATG GAAGGACATC CTTGAAAAGT ATATGCTTGC ACATATAATC GAAAATTACG CTCAAGTTGT TGAAAAAGTA GACCAGGAAA CAAAAAAGAA AACATATACA CAAATTTTCC CACGTTACCA TCAACTGTCT GCTGTTGAAA GTCTCCTCGC AGATGTACGA CATAATGGTG TTGGCCAAAG ATACTTAATT CAACATAGTG CTGGTAGTGG AAAATCAAAT TCTATTGCAT GGCTGGCTCA TCAACTCGTA GGACTCGAAA AGAATGGAAA AGCCATCATT GACTCTGTGG TAGTTGTTAC AGACCGTGTA ATACTTGATA AACAAATTCG AGATACGATA AAACAATTTA TGCAGGTTTC TAGCACTGTA GCATGGGCAG AACACTCTGA TGATTTAAGG AAAGCAATCA ATGGCGGTAA GAAGATTATA ATAACTACTG TACATAAGTT CCCTATTATT CTTGATAGTA TAGGTTCAGA ACACAAAGGG CGTTCTTTTG CCATAATTAT TGACGAGGCC CATTCATCAC AGAGCGGTAA CATGTCGGCT AAGATGAATA TTGTATTATC GGGTGAAGTT ACTGGAGAAG AGGAAGATTT TGAAGATAAA ATCAACCGCC TTATGGAAGG GCGCAAAATG CTGAAGAACG CTAGCTATTT TGCATTTACC GCTACTCCGA AAAACAAAAC CCTTGAAATG TTTGGTATCC CATACCAAGA CGGAGATGAA ATTAAACATC GTCCGTTCCA TGTATATACA ATGAAGCAAG CAATTCAAGA AGGTTTTATT TTAGATGTAC TTAAATACTA TACCCCTGTT GACAGTTATT ACAGACTTGC TAAAACCATT GAAGATGATC CTTTATTTGA CAAGAAAAAA GCACAAAAGA AACTCCGTCA ATTTGTAGAA AGCAATAAAT TTGCCATATC ACAAAAAGCA GAAATCATGG TGAACCACTT TCATGATCAG GTTATTTCAA AAGGAAAAAT CGGAGGAAAA GCGCGAGCTA TGGTGGTTAC GAGTAGTATA GAGCGCTGTA TAGAATACTA TTACGCAATT AATAAATGCC TTGCTGACAG GCGTAGTCCT TATAAAGCTA TTATTGCTTT TTCTGGGGAA AAAGAATATG GTGGTAAAAC TTTAACCTCT GCAGCAATTA ATGGCTTCCC AGATAATACA ATTGAGAAGG TATTCCGTAA GGATCCATAT CGATTTCTTA TAGTGGCTGA TATGTTCCAA ACGGGTTATG ATGAACCACT ACTCCATACA ATGTACGTTG ACAAAATGCT ATCTGATATA AAGGCGGTTC AGACTCTATC TCGACTGAAC CGCTCTCATC CACAGAAACA TGATACTTTT GTACTTGATT TTGCAAATAA AACAGAGACC ATTGAAGCAG CATTCTCAAA ATATTATCGG ACAACTATTC TGTCTAATGA AACTGATCCG AACAAGCTTT ATGATCTCAT AGCAATTATG GAATCCCATC AAGTATACGA AAGTGGGCAT GTTGATTCGC TTGTTGAACT GTACTTAAAT GGTGCAGAAC GTGATAGGTT AGATCCTATC CTCGATGCAT GTACTGCTAT TTACAAAGAG CTTGATGACG AAGGGAAAAT TGAATTCAAA AGTGCAGCAA AAGCATTTGT TCGAACATAC GGCTTTCTTG GAGCTATTCT TCCTTACGGT AATGCAGAAT GGGAAAAGCT ATCAATATTT TTAAATTTAT TAATACCTAA ACTTCCTTCT CCCAAAGAAG ATGATTTATC TCAAGGGATA CTAGATTCAA TTGATTTAGA TAGTTACCGA GTGGAAGCTC GTGATTCTAT GTCTCTTGTA TTAGATGATG CTGACGCTGA GATTGGCCCT GTACCTGCTG GTCGTGTAGG TGGCATAGTG GAGCCAGAAA TGGATTTACT TTCTAGCATT CTTTCATCAT TTAATGACTT GTTCGGCAAT ATAGACTGGA ACGATGCTGA TAATGTTCGC CGCCAAATTC TAGAAATACC AGGAATGGTT ACAAAAGACG AGCGCTATAT TAACGCAATG AAAAATTCAG ACAAGCAAAA TGCGCGTATG GAAAGTGAAC GTGCCCTTCA GTCGGTTATA TTTAGTATAA TGGCGGATAA TATGGAGTTA TTTAAGCAGT TTAATGATAA TCCTTCGTTT AAGAAATGGC TGTCGGATCT TGTTTTCAAT TTAACGTATA ACCCTGAAGG AAAGCCATTT GAAACTCCTT CCAATGATTC AAATAACAAA TAA
|
Protein sequence | MTATNTRESG LESLIVDWLV NQNGYEQGSN ADYNRDYAID ETRLFRFLSA TQPDEMEKLG VFKSDLKKAQ FLNRLRGEIA KRGIIDVLRN GIKVYPADLV MFYLTPSERN IKAKALFEQN IFSVTRQLQY SKDATRLALD LCIFINGLPV ITCELKNQLT KQNVDDAVYQ YKTDRDPKEL LFQFKRCMVH FAVDDARVKF STKLDGKASW FLPFDKGYND GAGNPPNSSG IMTDYLWKDI LEKYMLAHII ENYAQVVEKV DQETKKKTYT QIFPRYHQLS AVESLLADVR HNGVGQRYLI QHSAGSGKSN SIAWLAHQLV GLEKNGKAII DSVVVVTDRV ILDKQIRDTI KQFMQVSSTV AWAEHSDDLR KAINGGKKII ITTVHKFPII LDSIGSEHKG RSFAIIIDEA HSSQSGNMSA KMNIVLSGEV TGEEEDFEDK INRLMEGRKM LKNASYFAFT ATPKNKTLEM FGIPYQDGDE IKHRPFHVYT MKQAIQEGFI LDVLKYYTPV DSYYRLAKTI EDDPLFDKKK AQKKLRQFVE SNKFAISQKA EIMVNHFHDQ VISKGKIGGK ARAMVVTSSI ERCIEYYYAI NKCLADRRSP YKAIIAFSGE KEYGGKTLTS AAINGFPDNT IEKVFRKDPY RFLIVADMFQ TGYDEPLLHT MYVDKMLSDI KAVQTLSRLN RSHPQKHDTF VLDFANKTET IEAAFSKYYR TTILSNETDP NKLYDLIAIM ESHQVYESGH VDSLVELYLN GAERDRLDPI LDACTAIYKE LDDEGKIEFK SAAKAFVRTY GFLGAILPYG NAEWEKLSIF LNLLIPKLPS PKEDDLSQGI LDSIDLDSYR VEARDSMSLV LDDADAEIGP VPAGRVGGIV EPEMDLLSSI LSSFNDLFGN IDWNDADNVR RQILEIPGMV TKDERYINAM KNSDKQNARM ESERALQSVI FSIMADNMEL FKQFNDNPSF KKWLSDLVFN LTYNPEGKPF ETPSNDSNNK
|
| |