Gene Rsph17029_3227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3227 
Symbol 
ID4898582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp285937 
End bp287664 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content59% 
IMG OID640113826 
Productrestriction modification system DNA specificity subunit 
Protein accessionYP_001045096 
Protein GI126463983 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCG ATCAGCTTTT GCGCCTTTAC GATCGGGTTT GCGAGGCCGA GGACGCCGTC 
CCCCGCCTGC GCCGGCTTGT CCTCGATCTG GCCGTGCGCG GCAAACTGGT GCCGCAAGAC
CCGGAAGATG AACCGGCGGC CGAGTTGCTG AAGCGGATCG AGAAGGAAAA GGCCCGGCTG
GTGAAGGCGG GGGAGATCCG GAAGCCGAAA TCCATAGAAA GCCCGGATGA CGCCCCTTTT
AACCTTCCGG CGAACTGGGC ATGGAGCAAT ATCGCATCTC TTGGTTCCGT CTCGCCTCGT
AACGAAGCCG AAGATGACGC AATGGCGTCA TTCGTGCCCA TGACTCTTAT TCCGACCGAA
ATCAGGGCGG CAAATGGACA TGAGCCGCGT CATTGGCGTG AGATTAAGAA AGGCTACACT
CATTTTGCAG AAGGCGATGT AGGGCTGGCG AAGATAACGC CCTGTTTCGA GAATGGCAAA
TCTGCCGTAT TTCGCGGTCT GACAGGTGGC TTCGGTGCGG GAACGACCGA GCTTCACATT
GTTCGCCCGA TCTTCGTTTC ACCAGACTAC ATTCTGACAT ACTTGAAAAG CCCGCAATTC
ATCGAAAACG GGATTCCCCG GATGACCGGC ACAGCGGGGC AAAAACGCGT TCCGACGGAG
TATTTCATCG GCACCCCTTT CCCCCTCCCA CCCCTTGCCG AGCAACACCG CATAGTCGCC
AAGGTCGAGG AACTGATGGC GCTGCTCGAC CGGATCGAGG CGGCGCGGGC GGGACGCGAG
GAGACGCGCA ACCGCCTGAC CGCCGCAACC CTTGCCCGCC TGACCGACCC CAAGGCCGAC
GCCCCCGCCG CCACCCGGTT CGCGCTGGAC ACGCTCGCCC CCCTCACCAC CCGCCCGGAC
CAGATCAAGA CCCTGCGCCA AACCATCCTC AACCTTGCCG TTCGAGGCAG GCTCGTCCCG
CAAGACCCTG CGGATGAACC GGCGTCGGAG TTGTTGAAAC GGGTATCCGT CGAACGCGCT
CGACTTGAGA AAGCAGGAGC CATCCGATCA ACGAAGCGCG CTGCATCCCT TGAAGGTACA
AAATTACGGT TCAACCCACC GCCAAGATGG CGCTGGACGA ATCTTGAATG TCTCTTTGCA
ATAACAGGGG GAATACAGAA GACACCGGGT AGAATGCCAA AAGCCAATGC GTTTCCGTAT
TTGGGCGTCG GAAATGTTTA CCGGAATCGG ATTGATCTAA CTAATCTGAA GAAATTCGAA
TTGCAAGACG GTGAAGTTGA TAAGTTCGGC CTGCAACCGT TCGACATTTT GGTTGTTGAA
GGCAACGGAA GCGCAACGGA AATCGGACGT TGTGCAATGT GGGAAGGTCA GATTGAGCAA
TGCGTTCATC AAAATCACCT GATACGGTGC CGGCCCATTG ATCCGAACCT GTCGCGATAT
GCGTTGCTCT ACCTCAATTC CCCGCTTGGA ATGGACGAGA TGACGGAGTT GGCCATCACA
TCGGCAGGTC TTTACAACCT CAGCGTAGGC AAAATCTCAA CAGTTCCCCT TCCCCTCCCG
CCCCTCGCCG AACAGCACCG CATCGTGGCC AAGGTCGACG CGCTGATGCG CCTCCTCGAC
GATCTCGAGG CAGCGCTGAG CGCCTCGTCC ACCACCCGCG CCCGCCTCCT CGACGCCACC
CTGCGCGCCG CGCTGGCCGA GCCGGGGCAC ACGAGGGCCG CCGCATGA
 
Protein sequence
MNADQLLRLY DRVCEAEDAV PRLRRLVLDL AVRGKLVPQD PEDEPAAELL KRIEKEKARL 
VKAGEIRKPK SIESPDDAPF NLPANWAWSN IASLGSVSPR NEAEDDAMAS FVPMTLIPTE
IRAANGHEPR HWREIKKGYT HFAEGDVGLA KITPCFENGK SAVFRGLTGG FGAGTTELHI
VRPIFVSPDY ILTYLKSPQF IENGIPRMTG TAGQKRVPTE YFIGTPFPLP PLAEQHRIVA
KVEELMALLD RIEAARAGRE ETRNRLTAAT LARLTDPKAD APAATRFALD TLAPLTTRPD
QIKTLRQTIL NLAVRGRLVP QDPADEPASE LLKRVSVERA RLEKAGAIRS TKRAASLEGT
KLRFNPPPRW RWTNLECLFA ITGGIQKTPG RMPKANAFPY LGVGNVYRNR IDLTNLKKFE
LQDGEVDKFG LQPFDILVVE GNGSATEIGR CAMWEGQIEQ CVHQNHLIRC RPIDPNLSRY
ALLYLNSPLG MDEMTELAIT SAGLYNLSVG KISTVPLPLP PLAEQHRIVA KVDALMRLLD
DLEAALSASS TTRARLLDAT LRAALAEPGH TRAAA