Gene Rru_A3333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3333 
Symbol 
ID3836785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3845680 
End bp3846930 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content67% 
IMG OID637827454 
ProductSAM-dependent methyltransferase 
Protein accessionYP_428414 
Protein GI83594662 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.427413 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAT GCCCTATCCT GGGGCCTCCT CCCCCTTTCG GAATCGTCAT GACAGAGCCA 
AATTCGCCGT CTCTCCCCGA AATCCGCCTT CTCGCCGGCC ACTCGAAGCG GCTCCGCCAG
GGCCATCCCT GGGTTTTCTC CAACGAGATC GCCATGACCC CGGAGGCCAA GGCCATGACT
CCGGGAGCGC TTGTCACCTT GCGCGATGCC GGGGATGAGC CGCTGGCCAT CGCCACGTTT
AATCCGCATT CGCTGATCGC CGCGCGGGTT CTTGATCGCG ATCTTGGCGC CAGCATCACC
AAGGAATGGG TCTTCGTCCG GCTTCAGCGG GCCTTGCGCC TGCGCGACAC GCTGTTTGAC
CAACCCTATT ACCGGCTCGT CCATGGCGAG GCCGACGGCC TGCCCGGGCT GGTCGTCGAC
CGTTTGGGCG ATGTCATCGC CGTCCAGGCC AACAGCGCCG GGATGGATCT GCTGACGCCA
CTGATTCTCG ACGCCATCGA AAACCTGCTG GCGCCGCGCG CCATCGTGCT GATCAACGAC
GCGCCGGTGC GCCTACTCGA AGGGCTGACC CAGGAAACCG CCCTGGCCCG GGGCCAGATC
GACGGCCCGG TGCCGGTGAT CGAAAACGGC TTTTCCTATC TGGCCGACCT GCAGGAAGGC
CAGAAGACCG GCTGGTTCTT CGATCAGCGG CCCAATCGCG CCTTCGTCGC CGACCTCGCC
CGCGGCCGCT CGGTGCTTGA TGTCTATAGC TATGCCGGGG GCTTCGGGCT GCTGGCGCTG
GCGCGCGGCG CCACCTCGGC CACCCTGGTC GACCGCTCCG ACCAGGGCCT GCTGCTGGCC
CAACAGGCCG CCGCCACCGC CGGACTCGGC GGGGCGCTGA CCACCCACAA GGCCGAGGGT
TTCGCCTACC TCGAGCAAGC AGAGCATGAG GGCAAGCGTT TTGGCGTCGT CGTCTGCGAT
CCGCCGGCCT TCGCCAAGAC CCGCAAGGAT CAGGCCTCGG GCGCCAAGGG CTATCGCAAG
GTCGCCCGCC TCGCCGCCGC CCTGGTCGAA CCCGGCGGCT TTCTGTTCGT CGCCTCGTGC
AGCCATCACA TGCCGATCGA CCGCTTCCAG GACGAAACCG CCCATGGCAT CGCCCAGGCC
GGGCGGACGG GGCGGATCTT GCGCTCGGGC GGCGCCGGTC CCGACCATCC GGTCCATCCC
GATCTGGCCG AATCGGCCTA TCTCAAGACC CTGACCTGGG CGATCGACTA A
 
Protein sequence
MRKCPILGPP PPFGIVMTEP NSPSLPEIRL LAGHSKRLRQ GHPWVFSNEI AMTPEAKAMT 
PGALVTLRDA GDEPLAIATF NPHSLIAARV LDRDLGASIT KEWVFVRLQR ALRLRDTLFD
QPYYRLVHGE ADGLPGLVVD RLGDVIAVQA NSAGMDLLTP LILDAIENLL APRAIVLIND
APVRLLEGLT QETALARGQI DGPVPVIENG FSYLADLQEG QKTGWFFDQR PNRAFVADLA
RGRSVLDVYS YAGGFGLLAL ARGATSATLV DRSDQGLLLA QQAAATAGLG GALTTHKAEG
FAYLEQAEHE GKRFGVVVCD PPAFAKTRKD QASGAKGYRK VARLAAALVE PGGFLFVASC
SHHMPIDRFQ DETAHGIAQA GRTGRILRSG GAGPDHPVHP DLAESAYLKT LTWAID