Gene Tcr_0334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcr_0334 
Symbol 
ID3762359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiomicrospira crunogena XCL-2 
KingdomBacteria 
Replicon accessionNC_007520 
Strand
Start bp376764 
End bp379991 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content40% 
IMG OID637785043 
ProductHsdR family type I site-specific deoxyribonuclease 
Protein accessionYP_390604 
Protein GI78484679 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCA TGACCGAAGA TACCCTTGTT CAGCAAACGA CTGCCAGCTA TTTGGTCGAC 
ACTCTTAAAT GGGATGAATC GATTAATGGC ATGTATGAAA AGCTGGGTAA AGAAGGGTCT
TTGGGGCGTG AATCTGAAAA AGAGATGGTA TTAACCCGTT ATTTAGGCGA GGCCTTAATG
CGGTTAAATC CTGGCTTTCC TATGGAGGCA TATCAAGATG CCATTCGGCA AATTATAGAA
GTGAACTTGT CTCTGTCTTT ATTGGGGATT AACCAAGATA AGGACAAACT TTATAAAAGT
CGTGTACAGG TCTCTTTTAA GAATGAAAAA GGCGAGCGTA AACAGAAGTC ATTAAAAATC
TTTGATTTTG ATAACCCAGA TAACAACCAC TTTTTAGTGG TGCGTGAGCT TTGGGTGCAA
GGTGATATAT ATCGTCGTCG TGCTGACTTG GTCGGCTTTG TAAATGGTAT TCCATTGGTG
TTTATGGAGG TTAAAAACCT GCATAAAGAA GTTAAAGCCG CTTATGAGCA AAACTTTAAA
GATTATCTAG ATACCGTACC GCATTTGTTT CATCACAATG CGTTTGTGAT TTTGGGGAAT
GGGGTAGATG CCAAGATTGG TTCAGTCTCA TCTAAGTTTG AACACTTTAA CGATTGGAAG
CGTTTGCACG AAGAAGAGCC TGGCGTTGTG AATATGGAAA CCTTGTTAAA GGGTGTGTGC
AATAAAGCCA ATTTACTCGA TCTGTTTGAG AACTACACCA TCTTTGATGA CTCTAGCGGC
AAACTAATCA AAATTGTGGC ACGTAACCAC CAATTTTTAG GGGTGAATCG TGCGGTGGAT
GCAGTTAAAA ACCGTGAAAG TTTAGCGGGT AAGCTTGGGG TTTTTTGGCA TACTCAAGGA
TCGGGAAAAT CTTATTCTAT CGTTTTCTTC GCCAAAAAAG TTCACCGTAA ATTAGGCGGT
AACTACACCT TTATGGTGCT GACAGATCGC GAAGATTTAG ACAGCCAAAT TTATAAAACT
TTTGCCGGTA TAGGGTTGGT GGATAACGAT AAAGATCCTT GCCGAGCTTC AAGTGGTAAA
CATTTAGATG AATTGCTTGG TCAACATAAG TCTTATGTGT TTTCGTTGAT TCAAAAGTTC
AATGACCAGA ATATGACAGG TTCTGATCGT GATGACATTA TTGTTATAAC GGATGAGGCG
CATCGTACCC AGTATGGGTC TTTGTCTTTA AACATGCGAA ATGCGTTACC GAATGCGAGC
TTTATTGGCT TTACTGGAAC CCCGCTATTA AAAGATGATG AGATTACTCG CCGCGTCTTT
GGTGACTATG TTTCAACTTA TGACTTTCAA CGTGCCGTAG AAGATAATGC GACTGTGCCG
CTTTATTATG ATGCACGTGG TGATAAGTTG GTTTTTATTA ATGATGCAGG TGAAGAGTCT
AAGGTGGCTG ATCCTAAAGG CATTAATGAG AAGATTGCCG AGAAGCTAGA AGAGTTAGAA
ATTGATGATG TTGATGTAGC TCAACGTTTG GAAAGAGAGT TAAAACGAGA TTATCACATT
TTAACTTCTA GTAGCCGTTT AGAACAAATT GCCAAGGATT TTGTGGAGCA CTATTCCGAA
GGGTGGGAAA GCGGCAAGGC AATGTTTGTT TGCATTGATA AAGTCACCAG TGTTCGTATG
CACCAACTGA TTGATAAATA TTGGCAAGAA CGCATTACCT ATCTCCAAAA ACAGATTGAT
GGGGTGGCTT ATGATGAGCA AGAAACCATT TTTAGAGAAC GCCAACTTTC TTGGATGAGA
GAAACACAAA TGGCGGTGGT CGTCAGTGAA GAGCAAGGTG AAGTTGAGAA ATTCAAGAAG
TGGGATTTAG ATATTCTGCC TCACCGTAAA CTGATAAAAG AAGGTTTTAT TGGTAGTGAT
GGTAAACGCA TTGATCTAGA ACAAGCGTTT AAAAAAGATG AGCATCCCTT TAGGGTGGCA
ATAGTCTGTG CTATGTGGCT GACAGGGTTT GATGTACCAA CATTATCAAC CTTGTATCTA
GATAAGCCTT TAAAGGCACA TACACTGATG CAAGCCATTG CACGAGCAAA CCGTGTAGCA
GAAGGCAAGA ACAATGGCTT AATTGTCGAT TATTGTGGCA TCTTAAAGAA TTTGAGAAAA
GCCTTGGCTG ATTTTGGCGG TGCAACAGAT GAAGGTAGAA CCGATGGCGG TGAGCATGAC
CCTGCAAGAC CACAAGAAGA GTTGCTGGTT GATTTAGCTG AGTCGATTGA TTTAGTGAAA
AGCTTCTTAA AAGATCGCTA TGTTGAGCTT TCTTCCATTA CTGAATCAGA GGGTTTTGCT
CGAAATGCCT CTATTGTTGC AGCCAAAGAA GTGATTAATG AAAACGATCA AACTCGTAAA
CAATACGAAG TGACAGCTAG AGAAGTCTTT AAGAAGTTCA AAGCCTGCAT CAATATACGA
CCTGAAATTA ACGATTATCG CTCGGATCGT GATGCAATAA ATATCATCTA TCAATCGTTA
CAAGATGACC GTGAAAAGGC CGATATTAGT CATATTATTA AAGAGCTTCA CTTGATTGTT
GATGAATCGA TAGATGTCAC GATGTCTGGT TCTGAAGATA GCCATTTGTA TGATATTAGT
AAAATCGATT TTGAAAAGTT AAAACAAGAG TTTGCACGAT CCGATAAAAA GCAGACTGTA
GTTGCTAACT TAAAAGATGC GATAGCTCAG AAACTAGCTG ATATGCTTAA ACAAAACCCT
TCACGAACGG ATTTTCAAAA ACGATATGAA GAGATTGTTG ATGAATACAA CAATGAAAAA
GATCGAGTAA CAATTGAAAA GACGTTTGAG GCGTTGATGA TACTGGAAGG TGAACTGACC
GAAGAGCAGG AGCGTGCGGT TCGGGAAAAC TTGGATGAAG AATCGCTGGT GGTGTTTGAT
TTATTGAGCA AGCCTGATCT AACGCCTAAA GACATTAAAC TGATTAAGGC GGTGGCGGTT
GAGCTGCTTG GGAAGTTAAA AGCCGAGAAG CTTAAAGTTG ACAATTGGCG TGAAAAACAA
GCGACCCGTG ATGCCGTCAA ACAAACTATT TATGATTTCT TGTACGATGA ACGCACCGGC
TTGCCGGAAA GCTATGAAGT CGATGAGATT GAAACGGTGA CCGAAAACCT GTTTACCCAT
GTTTATAGGG TTTATCCAGC CGTGCCGTCT CCAGTGTATG CATATTAG
 
Protein sequence
MNTMTEDTLV QQTTASYLVD TLKWDESING MYEKLGKEGS LGRESEKEMV LTRYLGEALM 
RLNPGFPMEA YQDAIRQIIE VNLSLSLLGI NQDKDKLYKS RVQVSFKNEK GERKQKSLKI
FDFDNPDNNH FLVVRELWVQ GDIYRRRADL VGFVNGIPLV FMEVKNLHKE VKAAYEQNFK
DYLDTVPHLF HHNAFVILGN GVDAKIGSVS SKFEHFNDWK RLHEEEPGVV NMETLLKGVC
NKANLLDLFE NYTIFDDSSG KLIKIVARNH QFLGVNRAVD AVKNRESLAG KLGVFWHTQG
SGKSYSIVFF AKKVHRKLGG NYTFMVLTDR EDLDSQIYKT FAGIGLVDND KDPCRASSGK
HLDELLGQHK SYVFSLIQKF NDQNMTGSDR DDIIVITDEA HRTQYGSLSL NMRNALPNAS
FIGFTGTPLL KDDEITRRVF GDYVSTYDFQ RAVEDNATVP LYYDARGDKL VFINDAGEES
KVADPKGINE KIAEKLEELE IDDVDVAQRL ERELKRDYHI LTSSSRLEQI AKDFVEHYSE
GWESGKAMFV CIDKVTSVRM HQLIDKYWQE RITYLQKQID GVAYDEQETI FRERQLSWMR
ETQMAVVVSE EQGEVEKFKK WDLDILPHRK LIKEGFIGSD GKRIDLEQAF KKDEHPFRVA
IVCAMWLTGF DVPTLSTLYL DKPLKAHTLM QAIARANRVA EGKNNGLIVD YCGILKNLRK
ALADFGGATD EGRTDGGEHD PARPQEELLV DLAESIDLVK SFLKDRYVEL SSITESEGFA
RNASIVAAKE VINENDQTRK QYEVTAREVF KKFKACINIR PEINDYRSDR DAINIIYQSL
QDDREKADIS HIIKELHLIV DESIDVTMSG SEDSHLYDIS KIDFEKLKQE FARSDKKQTV
VANLKDAIAQ KLADMLKQNP SRTDFQKRYE EIVDEYNNEK DRVTIEKTFE ALMILEGELT
EEQERAVREN LDEESLVVFD LLSKPDLTPK DIKLIKAVAV ELLGKLKAEK LKVDNWREKQ
ATRDAVKQTI YDFLYDERTG LPESYEVDEI ETVTENLFTH VYRVYPAVPS PVYAY