Gene Tmel_0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmel_0402 
Symbol 
ID5297837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermosipho melanesiensis BI429 
KingdomBacteria 
Replicon accessionNC_009616 
Strand
Start bp419495 
End bp422662 
Gene Length3168 bp 
Protein Length1055 aa 
Translation table11 
GC content38% 
IMG OID640768665 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_001305656 
Protein GI150020302 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCG GCGTAAACGA ATTAGAGAAA AAGACTCAGG AGAGAGTTAT AAACTTCTTT 
GTTGAGGTTC TTGGTTATGA TTATCTTGGG AACTGGGAGA AGAGGGAAGG AAACTCAAAT
ATTGAGAAGG AATATCTTAT CAATTTTCTA AGAAAGCAGG GATACAATGA TGAAGTTATA
AAAAAAGCAA TTGAAAAGCT GAAGAAAATA TCCGAGAACC AGCAAAAATT GCTCTATGAT
AGAAATAAAG AAATTTACGA ACTTTTAAGA TATGGAACAG ATGTAAAAAT TCATCCTTCC
GAGCCATCCA TACATGTCAA TTTCATAGAT TGGGAGAATC CAGAGAATAA TCATTTTACA
ATAGCCGAGG AGGTAACAAT AAGGGGAAGA CATACCAAGA GACCTGACAT AGTTCTATAT
CTAAACGGGA TCGCCATCGG AGTTTTGGAA CTGAAAAGGT CAAGTATTTC CATAAGCGAG
GGAATACGGC AGAACCTTGA CAACCAGAAG AAGGAGTTCA TTCAAGATTT CTTCACCACC
GTACAGTTAA TCATAGCTGG AAACGATACG CAGGGTTTAA GATATGGAAC GATAGAAACA
CCGGAAAGGT ACTATCTTGA ATGGAAAGAG GAGAATCCGG AGTATGATCA TAACAAAAAA
ACAACCATAC CCAGAGCACT ACCGAGGGAT AAATGCGAAG TATCCGATAA TATCCTTGAC
TGCGATATTT ACAGGCTTTT AAATAAAAAT AGGCTCTTGG AAATTATACA CGATTTTATA
ATTTTTGATG CTGGTATCAA AAAGGTTCCA AGACACAACC AGTATTTTGC TGTAAAGGCA
GCCCAGAAAA GCATAAAAAA GAGAGAAGGT GGCATAATTT GGCATACTCA AGGAAGCGGG
AAATCGCTCA TAATGGTGTG GCTTGCAAAA TGGATTATAG AGAATATTTC AGACAGTAGA
GTTTTGATTG TTACCGATAG AATAGAGTTA GACGAGCAGA TTGAAAAAGT ATTCAAGGGT
GTAGGGGAAA ATATATATAG AACAAAATCA GCAAGGGATT TAATTGAACA ACTAAACAGA
AAAGATGAGA ATCTTATGTG TTCATTAATA CATAAATTCG GAAAAGCAGA GGCAACGGAC
AAGGATTATG ATGAATATAT CCATGAACTA AAGGTAACAC TCCCACGTGA TTTTAAGCCT
AAGGGTAACA TATTTGTTTT TGTGGATGAA TGCCATAGAA CTCAATCAGG TAAGCTACAT
AAAGCCATGA AAGAAATTTT ACCCAATGCT GTTTTTATTG GATTTACAGG AACGCCACTC
CTCAAAAAAG ATAAACCTAC AACCTTGGAA ACATTTGGTA AGTATATCCA TACATACAAG
TTTGATGAAG GTGTCCAAGA CGGTGTAATC CTTGATTTGA GGTATGAAGC GAGGGATGTG
GATATCAAAG TCGTGTCAGA GGATAAAATA GATAAGTGGT TTGAAGTAAA AACAAAAGGC
TTGTCAGATA TTGCCAAGGC TCAACTTAAA GAGAGATGGG GGACATTGAA GAAACTTTAC
AGTTCGAGAA GTAGGATGGA GAAGATCGTT GCGGACATAA TCTTCGATTT TGAAACAAAA
CCAAGATTGG CAACTGGCAG GGGCAACGCC ATGCTTGTGG CAGGTAGCAT ATACGAAGCG
TGTAAGTACT ATGAACTTTT TAAGTCGAGA GGATTTGACA AAGTTGCCAT TGTAACATCC
TATCGTCCAA GTATTGTGAG CATAAAAGGG GAAGAGACCG GAGAGGAAGG AGAGGCGGAA
AACATAAAGA AGTACGAAAT TTACAAAAAG ATGATAGCCG ATTATTATAA CATCTCTGAA
GAAGAAGCAG TAAAAGATTA CCGGATAGAG AAATTTGAGA AGGATGTTAA AAAGAAATTT
ATTGAAGAGC CCGGACAGAT GAAATTGTTA ATCGTGGTAG ATAAACTTTT AACAGGATTC
GATGCGCCAA GCGCTACTTA TCTTTACATA GACAAGCCCA TGAGGGACCA TGGCTTATTT
CAGGCAATCT GTAGGGTAAA CAGGTTGGAT AATAAAGACA AAGGAGATGA GTTGGACAAG
GATTATGGTT ACATTGTGGA TTATAGGGAT CTATTCAACA GTATGGAAAA AGCCATTAAA
GATTACACAT CAGGAGCATT CGCAGAGTTT GATAAGGAAG ATGTTGAAGG ATTGTTAAAG
GATAGATTAA CCGATGCGAA AAAGAGACTT GACGAAGTTC TTGAGAAAGT TGAACTATTG
GTGGAGGGCG TAAAGCCACC AAAGGGCATA AATGAGTATA GGGAATATTT TGTTGGAGAT
AACTCTGAGG AAAAGCAACA GCTCAGGTTA GAATTTTACA AGCGAGTATC TTCCCTTGTG
AGAGCTTACA CGAATATAGC TAACGAATTA ACCGAAGCAG GATATTCCGA GAGGGAACAA
AAAGAAATCA AGGAAAAAGT CAGGCATTAC GCAGCAATAA GAGACGAGCT AAAACTGATG
AGCGGCGATT ACTTAGATTT AAAGGTGTTT GATCCTGCAA TGAGGTATCT TATAGACTCT
TACATCCAAG CGGAGGAAAG CAGAACATTA GCATCGTTCG AAGAAACATC ATTGCTGGAA
ATAATAGCAC TTAATGGTCT TCAGAAGGCT TTAGAAACAT TACCAAGTGC CATCGGAAAG
AACAAGGAAG CACTTGCTGA AACAATAGAG AACAATATAA GGAAGTTGAT AGTAGACAAG
AGAGATATTA ATCCGGCTTA TTACGACAAA ATGTCCAAAA TACTTAAGGA ACTGGTAGAA
AAAAGGAAAA AAGAAGTTAT CAGTTACGAG GAGTACCTAA AGGAAGTTGA AAAATTAACC
AAGATGCTCA TAAATAGGGA ATTTAATGAA GAACCTTACC CCCCAAGCAT AAGAAGCAGT
ATGGCGAAAA AGGCTATTTA TGACAACTTA GAAGGCATTG AAAATAGAGA GAAACTTACA
AATGCTATTG ATGAAGCTGT AAGAAAAACC AAAAAAGATG ACTGGAGAGG GCATAAGATA
AAAGAGCGAC AGGTAAGAAA TGCCATTAAG AAAATCCTAG AAGATGAGGG TCTCGAGGAA
CTCACTGAAA AAATCTTTCA GATAGTGTTA AAACAGGGTG AGTATTAA
 
Protein sequence
MSIGVNELEK KTQERVINFF VEVLGYDYLG NWEKREGNSN IEKEYLINFL RKQGYNDEVI 
KKAIEKLKKI SENQQKLLYD RNKEIYELLR YGTDVKIHPS EPSIHVNFID WENPENNHFT
IAEEVTIRGR HTKRPDIVLY LNGIAIGVLE LKRSSISISE GIRQNLDNQK KEFIQDFFTT
VQLIIAGNDT QGLRYGTIET PERYYLEWKE ENPEYDHNKK TTIPRALPRD KCEVSDNILD
CDIYRLLNKN RLLEIIHDFI IFDAGIKKVP RHNQYFAVKA AQKSIKKREG GIIWHTQGSG
KSLIMVWLAK WIIENISDSR VLIVTDRIEL DEQIEKVFKG VGENIYRTKS ARDLIEQLNR
KDENLMCSLI HKFGKAEATD KDYDEYIHEL KVTLPRDFKP KGNIFVFVDE CHRTQSGKLH
KAMKEILPNA VFIGFTGTPL LKKDKPTTLE TFGKYIHTYK FDEGVQDGVI LDLRYEARDV
DIKVVSEDKI DKWFEVKTKG LSDIAKAQLK ERWGTLKKLY SSRSRMEKIV ADIIFDFETK
PRLATGRGNA MLVAGSIYEA CKYYELFKSR GFDKVAIVTS YRPSIVSIKG EETGEEGEAE
NIKKYEIYKK MIADYYNISE EEAVKDYRIE KFEKDVKKKF IEEPGQMKLL IVVDKLLTGF
DAPSATYLYI DKPMRDHGLF QAICRVNRLD NKDKGDELDK DYGYIVDYRD LFNSMEKAIK
DYTSGAFAEF DKEDVEGLLK DRLTDAKKRL DEVLEKVELL VEGVKPPKGI NEYREYFVGD
NSEEKQQLRL EFYKRVSSLV RAYTNIANEL TEAGYSEREQ KEIKEKVRHY AAIRDELKLM
SGDYLDLKVF DPAMRYLIDS YIQAEESRTL ASFEETSLLE IIALNGLQKA LETLPSAIGK
NKEALAETIE NNIRKLIVDK RDINPAYYDK MSKILKELVE KRKKEVISYE EYLKEVEKLT
KMLINREFNE EPYPPSIRSS MAKKAIYDNL EGIENREKLT NAIDEAVRKT KKDDWRGHKI
KERQVRNAIK KILEDEGLEE LTEKIFQIVL KQGEY