Gene Rru_A3541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3541 
Symbol 
ID3836996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp4073757 
End bp4076546 
Gene Length2790 bp 
Protein Length929 aa 
Translation table11 
GC content70% 
IMG OID637827664 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_428622 
Protein GI83594870 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTATTTT TCGCTGATCG AAGCCCCTTC GTCCGTCCTC TGTCAGGATA TGCGCCTGTG 
ACGCCCGCCG CTCCCCGTTC CACCGGTTCC GCCGCCGCCC CGCCGCCCTC CCCCGCCGTC
CTTGATGACC GGAGCGGAAC GGAGGGCGAT GTCACGCCGA TGATGGCCCA GTATCTGGCC
GTCAAGGCCG CCCACCCCGA TTGCTTGCTA TTCTATCGCA TGGGCGACTT CTACGAGATG
TTTTTCGAGG ACGCGGTCAA GGCCGCCGAG ACGCTGGATA TCGCCCTGAC CAAGCGCGGC
CGGCACGCCG GGGCGGATAT TCCCATGTGC GGCGTGCCCA TCCACTCCCA CGAGGGCTAT
CTGTCGCGGC TGATCCGCGC CGGCATCAAG GTGGCGATCT GCGAGCAGAT GGAAGATCCC
GCCGAGGCCC GGCGCCAGCG CGGCTATAAG GCGGTGGTGC GCCGCGACGT GATCCGCGTG
GTGACCGCCG GCACCCTGAC CGAAGACGAA CTGCTTGATG CCCGCGCCCA TAATTATCTG
GCCGCCGTCG TCCGTTTGCG CGACGCGGTC GGCATGGCCT GGGTCGATGT CTCGACCGGC
GATCTGGTGG CCCAGCCGCT GGCCGAGGCC GATATCGGAC CGGCCCTGGC CCGCCTCGCC
CCGGGCGAGG TGCTGATGCC CGAAAAGCTG GCGGGCGATC CGGCGCTGCG CGAGATCCTG
GCGCCCTTGG CCGGGCGGAT CAGCCCGCTG CCGGCCAGCC GCTTCGATAG CGAAAACGCC
CGCAAGCGGG TGGAGGGCCT GTTCGGGGTC AAGGCGCTCG ATGGCTTCGG TGGCTTCGGC
CGGGCCGAGG TGGCGGCGAT CGGCGCCTTG ATCGATTACG TCGAACTGAC CCAGGTCGGC
CGCCTGCCCC GGCTGTCGCC GCCGCGCCGG CTGTCGCTTG GCGCCATCCT TGAAATCGAC
GGGGCGACCC GGCGCAATCT GGAACTGACC GAAACCCTGG GCGGCGGCCG CAAGGGCAGT
CTGCTCGCCC GCATCGATTG CACGGTGACC GGGGCCGGGG CGCGGCTGCT GGCCGAGCGC
TTGGCCGCGC CGCTGACCGA TCCCGCGCAG ATCGGCGCCC GCCTTGATGG CGTCGGCTTC
CTGGTCAGCG CCGAGCGGGT GCGCGGCGAT CTGCGCGACA CCTTGCGCGG TTGTCCCGAT
ATCGCGCGCG CCCTGTCGCG GCTGTCGTTG GGGCGCGGCG GTCCGCGCGA TCTCGCCGCC
ATCGGCGAGG CGCTGTCGCG CATTCCGGCG CTGCGCGTGC TGGTGGTCGG CGCCGGCCTG
GGCGAGCCGC CGACCGAACT GACCGCCGCC TTGATCGATC TGGGCAGCCA CGAGGGGCTG
GTCGATCTGC TGGGCCGGGC CCTTGATGCC GACCTGCCGC TGCTGGCGCG CGATGGCGGT
TTCATCCGCC CGGGCTATGA CGCCGGGCTT GATGAACTGC GGGCGCTGCG CGACGAGGGC
CGGCGGCTGA TCGCCGGTCT GCAGGCGCGC TATGCCAGCG AAACCGCCAT CCCGGCGCTG
AAGATCAAGC ATAACAACGT GCTGGGCTAT TTCATCGAGG TCGCCGCCGG TCGCGCCGAC
AAGCTGATGG CCGCCGGCGG CCCCTTCCTC CACCGCCAGA CCCTGGCCTC GCAGGTGCGC
TTCACCACGG TGGAATTGTC CGAACTGGAA GACAAGATCC GCGGCGCCGC CGATAAGGCC
CTGGCCCTGG AACAGGCGCT GTTCGCCACG TTGTGCGCCG AGGTTCTGGG CTGCGCCGCC
GACATCGCCC GCGCCGCCAA CGGGCTGGCC TGCCTGGATG TCGCCGCCGC CCTGGCCGAT
CTGGCGGCGC GCGAGCGCTA TGCCCGGCCG GTGGTCGATA ACTCCACCGC CTTTCGCATC
CACAAGGGCC GCCATCCGGT GGTCGAGGCG GCTTTGGCCG ATCAGGCCGG CCCGGCCTTC
GTCGCCAATG ATTGCGACCT CGCCCCCGAC CAGCGGCTGT GGCTGCTGAC CGGCCCCAAT
ATGGCCGGTA AATCCACCTT CCTGCGCCAG AACGCCCTGA TCGCCGTGCT GGCGCAGATG
GGATCTTTCG TGCCCGCCGA ATCGGCCGAG ATCGGCGTGA TCGACCGGTT GTTCTCGCGG
GTGGGGGCGG CCGACGATCT GGCGCGCGGG CGCTCGACCT TCATGGTCGA AATGGTGGAG
ACCGCCGCCA TCCTCAATCA GGCCACCGAA CGCTCGCTGG TGATCCTTGA CGAGATCGGT
CGCGGCACCG CCACCTATGA CGGGCTGTCG ATCGCCTGGG CCACGGTCGA GTCGCTCCAC
GACGCCACCC GCTGCCGGGC GCTGTTTGCC ACCCATTACC ACGAACTGAC GGCGCTGGCC
TCGCGCCTTG ACCGGCTGTC GTGCCACACC TTGCGCATCA AGGAGTGGAA GGATCAGGTG
GTCTTCCTGC ACGAGGTCGG GCCCGGGGCG GCCGACCGCT CCTATGGCAT CCATGTCGCC
AAGCTGGCCG GGCTGCCCGC CGCGGTGATC GCCCGGGCCG AACAAGTGCT GGCGATCTTG
GAAAAGGGCG ATGCGTCGAG CGCGGCGACG CGGCTGGCCG ATGACCTGCC GTTGTTCGCC
GCCGCCCGCC CGCGTGCCGG CCTTCCCACC CCGCCGCCCG GCCCCCACCC CCTGGCCGAG
GCCCTCAACG CGATCAACCC CGACGAAATG ACCCCGCGCG AGGCCCTTGA CGCCCTTTAC
CGGCTGAAGG CGGTGATGAA GCGGGAGTAG
 
Protein sequence
MLFFADRSPF VRPLSGYAPV TPAAPRSTGS AAAPPPSPAV LDDRSGTEGD VTPMMAQYLA 
VKAAHPDCLL FYRMGDFYEM FFEDAVKAAE TLDIALTKRG RHAGADIPMC GVPIHSHEGY
LSRLIRAGIK VAICEQMEDP AEARRQRGYK AVVRRDVIRV VTAGTLTEDE LLDARAHNYL
AAVVRLRDAV GMAWVDVSTG DLVAQPLAEA DIGPALARLA PGEVLMPEKL AGDPALREIL
APLAGRISPL PASRFDSENA RKRVEGLFGV KALDGFGGFG RAEVAAIGAL IDYVELTQVG
RLPRLSPPRR LSLGAILEID GATRRNLELT ETLGGGRKGS LLARIDCTVT GAGARLLAER
LAAPLTDPAQ IGARLDGVGF LVSAERVRGD LRDTLRGCPD IARALSRLSL GRGGPRDLAA
IGEALSRIPA LRVLVVGAGL GEPPTELTAA LIDLGSHEGL VDLLGRALDA DLPLLARDGG
FIRPGYDAGL DELRALRDEG RRLIAGLQAR YASETAIPAL KIKHNNVLGY FIEVAAGRAD
KLMAAGGPFL HRQTLASQVR FTTVELSELE DKIRGAADKA LALEQALFAT LCAEVLGCAA
DIARAANGLA CLDVAAALAD LAARERYARP VVDNSTAFRI HKGRHPVVEA ALADQAGPAF
VANDCDLAPD QRLWLLTGPN MAGKSTFLRQ NALIAVLAQM GSFVPAESAE IGVIDRLFSR
VGAADDLARG RSTFMVEMVE TAAILNQATE RSLVILDEIG RGTATYDGLS IAWATVESLH
DATRCRALFA THYHELTALA SRLDRLSCHT LRIKEWKDQV VFLHEVGPGA ADRSYGIHVA
KLAGLPAAVI ARAEQVLAIL EKGDASSAAT RLADDLPLFA AARPRAGLPT PPPGPHPLAE
ALNAINPDEM TPREALDALY RLKAVMKRE