Gene Rfer_1366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRfer_1366 
Symbol 
ID3960520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodoferax ferrireducens T118 
KingdomBacteria 
Replicon accessionNC_007908 
Strand
Start bp1471267 
End bp1472535 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content53% 
IMG OID637916183 
Productputative type I site-specific restriction-modification system, S subunit 
Protein accessionYP_522631 
Protein GI89900160 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0493853 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGACT CCGGCGCGGC GTGGATCGGG GAGATTCCGC AGGGGTGGGA AATCAAGCGA 
ATGAAAGACT GCTTTATCTC TAACAGCCGC GCCCAGCCGA ACAAGACGGT TTTGTCGTTG
AGCTACGGTA AAGTGATCGT CAAGGACATG GAGGAAAAGA AAGGCGTTAC ACCTGAAAGT
TTCGATTCAT ATCAGGGTGT ACATCCGGGC GACGTTGTAT TGCGTCTGAC TGATCTGCAA
AACGACCAGA AAAGCTTGCG TGTAGGTCGC GCAACGACCA AAGGAATTAT CACATCTGCC
TATTTGTGCG TATCGAGCCG GTCACTCAAT GATCGATATT CTGCGTATCT TCTGCATGAT
GTCGGCGACA TTCAAAAACT GTTTTATGGC TTGGGCGGTG GTGTTCGGCA ATCCATGAAA
TTTGCGGATT TGGCCGAGCT TTTGTTTTCA TTGCCAACGC CCGCCGAACA ACGCGCCATC
GCCGACTATC TCGACCGGCA AACCGCGCTG ATCGACCAGC GCCTTACCAC CCTTGCCGAA
AAGAAAGCCG TGTTGGCCGA ACTGCGCAAG GCTACTATCC ACGAGGCTGT GACCAAGGGC
TTGAACAAGA ACGCCCCGAT GAAAGATTCC GGGGTGGCGT GGATCGGGGA GATTCCGCAG
GGGTGGGAAA TCAAGCGAAT GAAAGACTGC TTTATCTCTA ACAGCCGCGC CCAGCCGAAC
AAGACGGTTT TGTCGTTGAG CTACGGTAAA GTGATCGTCA AGGACATGGA GGAAAAGAAA
GGCGTTACAC CTGAAAGTTT CGATTCATAT CAGGGTGTAC ATCCGGGCGA CGTTGTATTG
CGTCTGACTG ATCTGCAAAA CGACCAGAAA AGCTTGCGTG TAGGTCGCGC AACGACCAAA
GGAATTATCA CATCTGCCTA TTTGTGCGTA TCGAGCCGGT CACTCAATGA TCGATATTCT
GCGTATCTTC TGCATGATGT CGGCGACATT CAAAAACTGT TTTATGGCTT GGGCGGTGGT
GTTCGGCAAT CCATGAAATT TGCGGATTTG GCCGAGCTTT TGTTTTCATT GCCAACGCCC
GCCGAACAAC GCGCCATCGC CGACTATCTC GACCGGCAAA CCGCGCTTAT TGATACCCAG
CTTGCCACGC TGGACGAGCA GGCTCAGGTG TTGAAGGTAC TGCGCAAGGC CATCATCCAC
GAGGCGGTGA CCGGCAAGAT CGACCTATCC GGCTACGTGC CGCAAACTTC AGAGGCGCAG
GCCGCCTGA
 
Protein sequence
MKDSGAAWIG EIPQGWEIKR MKDCFISNSR AQPNKTVLSL SYGKVIVKDM EEKKGVTPES 
FDSYQGVHPG DVVLRLTDLQ NDQKSLRVGR ATTKGIITSA YLCVSSRSLN DRYSAYLLHD
VGDIQKLFYG LGGGVRQSMK FADLAELLFS LPTPAEQRAI ADYLDRQTAL IDQRLTTLAE
KKAVLAELRK ATIHEAVTKG LNKNAPMKDS GVAWIGEIPQ GWEIKRMKDC FISNSRAQPN
KTVLSLSYGK VIVKDMEEKK GVTPESFDSY QGVHPGDVVL RLTDLQNDQK SLRVGRATTK
GIITSAYLCV SSRSLNDRYS AYLLHDVGDI QKLFYGLGGG VRQSMKFADL AELLFSLPTP
AEQRAIADYL DRQTALIDTQ LATLDEQAQV LKVLRKAIIH EAVTGKIDLS GYVPQTSEAQ
AA