Gene Teth514_1218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTeth514_1218 
Symbol 
ID5877798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoanaerobacter sp. X514 
KingdomBacteria 
Replicon accessionNC_010320 
Strand
Start bp1258950 
End bp1261922 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content38% 
IMG OID641541568 
Producthypothetical protein 
Protein accessionYP_001662848 
Protein GI167039863 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCTA CTAATACACG AGAAAGTGGT CTTGAATCTT TGATTGTAGA TTGGCTTGTA 
AATCAAAATG GTTATGAACA AGGCAGCAAT GCTGACTATA ACCGTGACTA TGCTATTGAT
GAAACACGCC TATTTCGTTT TCTTTCAGCG ACGCAGCCAG ATGAAATGGA GAAACTCGGT
GTATTTAAAA GCGACTTAAA AAAGGCTCAG TTTCTAAACC GATTGCGTGG TGAAATAGCA
AAACGCGGAA TTATTGATGT ACTTCGTAAT GGTATTAAGG TTTATCCTGC TGACCTGGTT
ATGTTTTATC TAACACCAAG TGAGAGAAAC ATAAAAGCAA AAGCTCTATT TGAGCAGAAT
ATTTTCAGTG TTACACGACA ACTGCAGTAT TCAAAAGATG CGACTCGTCT TGCCCTTGAT
TTGTGCATTT TTATCAATGG CTTGCCAGTT ATAACATGCG AGCTTAAGAA TCAACTTACA
AAGCAAAATG TTGATGATGC TGTTTACCAA TATAAAACGG ATCGTGATCC GAAGGAACTG
CTTTTCCAAT TTAAACGCTG TATGGTTCAT TTTGCAGTAG ATGATGCAAG GGTCAAGTTC
AGTACTAAGC TTGATGGTAA AGCTTCCTGG TTCTTGCCAT TTGACAAAGG GTACAATGAT
GGAGCCGGCA ACCCTCCAAA TTCTTCTGGT ATAATGACAG ATTATTTATG GAAGGACATC
CTTGAAAAGT ATATGCTTGC ACATATAATC GAAAATTACG CTCAAGTTGT TGAAAAAGTA
GACCAGGAAA CAAAAAAGAA AACATATACA CAAATTTTCC CACGTTACCA TCAACTGTCT
GCTGTTGAAA GTCTCCTCGC AGATGTACGA CATAATGGTG TTGGCCAAAG ATACTTAATT
CAACATAGTG CTGGTAGTGG AAAATCAAAT TCTATTGCAT GGCTGGCTCA TCAACTCGTA
GGACTCGAAA AGAATGGAAA AGCCATCATT GACTCTGTGG TAGTTGTTAC AGACCGTGTA
ATACTTGATA AACAAATTCG AGATACGATA AAACAATTTA TGCAGGTTTC TAGCACTGTA
GCATGGGCAG AACACTCTGA TGATTTAAGG AAAGCAATCA ATGGCGGTAA GAAGATTATA
ATAACTACTG TACATAAGTT CCCTATTATT CTTGATAGTA TAGGTTCAGA ACACAAAGGG
CGTTCTTTTG CCATAATTAT TGACGAGGCC CATTCATCAC AGAGCGGTAA CATGTCGGCT
AAGATGAATA TTGTATTATC GGGTGAAGTT ACTGGAGAAG AGGAAGATTT TGAAGATAAA
ATCAACCGCC TTATGGAAGG GCGCAAAATG CTGAAGAACG CTAGCTATTT TGCATTTACC
GCTACTCCGA AAAACAAAAC CCTTGAAATG TTTGGTATCC CATACCAAGA CGGAGATGAA
ATTAAACATC GTCCGTTCCA TGTATATACA ATGAAGCAAG CAATTCAAGA AGGTTTTATT
TTAGATGTAC TTAAATACTA TACCCCTGTT GACAGTTATT ACAGACTTGC TAAAACCATT
GAAGATGATC CTTTATTTGA CAAGAAAAAA GCACAAAAGA AACTCCGTCA ATTTGTAGAA
AGCAATAAAT TTGCCATATC ACAAAAAGCA GAAATCATGG TGAACCACTT TCATGATCAG
GTTATTTCAA AAGGAAAAAT CGGAGGAAAA GCGCGAGCTA TGGTGGTTAC GAGTAGTATA
GAGCGCTGTA TAGAATACTA TTACGCAATT AATAAATGCC TTGCTGACAG GCGTAGTCCT
TATAAAGCTA TTATTGCTTT TTCTGGGGAA AAAGAATATG GTGGTAAAAC TTTAACCTCT
GCAGCAATTA ATGGCTTCCC AGATAATACA ATTGAGAAGG TATTCCGTAA GGATCCATAT
CGATTTCTTA TAGTGGCTGA TATGTTCCAA ACGGGTTATG ATGAACCACT ACTCCATACA
ATGTACGTTG ACAAAATGCT ATCTGATATA AAGGCGGTTC AGACTCTATC TCGACTGAAC
CGCTCTCATC CACAGAAACA TGATACTTTT GTACTTGATT TTGCAAATAA AACAGAGACC
ATTGAAGCAG CATTCTCAAA ATATTATCGG ACAACTATTC TGTCTAATGA AACTGATCCG
AACAAGCTTT ATGATCTCAT AGCAATTATG GAATCCCATC AAGTATACGA AAGTGGGCAT
GTTGATTCGC TTGTTGAACT GTACTTAAAT GGTGCAGAAC GTGATAGGTT AGATCCTATC
CTCGATGCAT GTACTGCTAT TTACAAAGAG CTTGATGACG AAGGGAAAAT TGAATTCAAA
AGTGCAGCAA AAGCATTTGT TCGAACATAC GGCTTTCTTG GAGCTATTCT TCCTTACGGT
AATGCAGAAT GGGAAAAGCT ATCAATATTT TTAAATTTAT TAATACCTAA ACTTCCTTCT
CCCAAAGAAG ATGATTTATC TCAAGGGATA CTAGATTCAA TTGATTTAGA TAGTTACCGA
GTGGAAGCTC GTGATTCTAT GTCTCTTGTA TTAGATGATG CTGACGCTGA GATTGGCCCT
GTACCTGCTG GTCGTGTAGG TGGCATAGTG GAGCCAGAAA TGGATTTACT TTCTAGCATT
CTTTCATCAT TTAATGACTT GTTCGGCAAT ATAGACTGGA ACGATGCTGA TAATGTTCGC
CGCCAAATTC TAGAAATACC AGGAATGGTT ACAAAAGACG AGCGCTATAT TAACGCAATG
AAAAATTCAG ACAAGCAAAA TGCGCGTATG GAAAGTGAAC GTGCCCTTCA GTCGGTTATA
TTTAGTATAA TGGCGGATAA TATGGAGTTA TTTAAGCAGT TTAATGATAA TCCTTCGTTT
AAGAAATGGC TGTCGGATCT TGTTTTCAAT TTAACGTATA ACCCTGAAGG AAAGCCATTT
GAAACTCCTT CCAATGATTC AAATAACAAA TAA
 
Protein sequence
MTATNTRESG LESLIVDWLV NQNGYEQGSN ADYNRDYAID ETRLFRFLSA TQPDEMEKLG 
VFKSDLKKAQ FLNRLRGEIA KRGIIDVLRN GIKVYPADLV MFYLTPSERN IKAKALFEQN
IFSVTRQLQY SKDATRLALD LCIFINGLPV ITCELKNQLT KQNVDDAVYQ YKTDRDPKEL
LFQFKRCMVH FAVDDARVKF STKLDGKASW FLPFDKGYND GAGNPPNSSG IMTDYLWKDI
LEKYMLAHII ENYAQVVEKV DQETKKKTYT QIFPRYHQLS AVESLLADVR HNGVGQRYLI
QHSAGSGKSN SIAWLAHQLV GLEKNGKAII DSVVVVTDRV ILDKQIRDTI KQFMQVSSTV
AWAEHSDDLR KAINGGKKII ITTVHKFPII LDSIGSEHKG RSFAIIIDEA HSSQSGNMSA
KMNIVLSGEV TGEEEDFEDK INRLMEGRKM LKNASYFAFT ATPKNKTLEM FGIPYQDGDE
IKHRPFHVYT MKQAIQEGFI LDVLKYYTPV DSYYRLAKTI EDDPLFDKKK AQKKLRQFVE
SNKFAISQKA EIMVNHFHDQ VISKGKIGGK ARAMVVTSSI ERCIEYYYAI NKCLADRRSP
YKAIIAFSGE KEYGGKTLTS AAINGFPDNT IEKVFRKDPY RFLIVADMFQ TGYDEPLLHT
MYVDKMLSDI KAVQTLSRLN RSHPQKHDTF VLDFANKTET IEAAFSKYYR TTILSNETDP
NKLYDLIAIM ESHQVYESGH VDSLVELYLN GAERDRLDPI LDACTAIYKE LDDEGKIEFK
SAAKAFVRTY GFLGAILPYG NAEWEKLSIF LNLLIPKLPS PKEDDLSQGI LDSIDLDSYR
VEARDSMSLV LDDADAEIGP VPAGRVGGIV EPEMDLLSSI LSSFNDLFGN IDWNDADNVR
RQILEIPGMV TKDERYINAM KNSDKQNARM ESERALQSVI FSIMADNMEL FKQFNDNPSF
KKWLSDLVFN LTYNPEGKPF ETPSNDSNNK