Gene Rru_A3684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3684 
Symbol 
ID3837140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp4231018 
End bp4232646 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content72% 
IMG OID637827808 
Productsulfotransferase 
Protein accessionYP_428765 
Protein GI83595013 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.843363 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGGCC ATCACGCGGC GGCGCGCGCC TTGCGCCGTC AGGGGCGCCT AGGCGAGGCG 
ATCGCCCGCT ATCGCGCGGC CCTGGCCGCC ACCCCGGGCG CGGCCGTTCT CCAAGCCGAA
ATGGCCGAAT GCCTGTTCGC CGCCGGTCAG GCCGAAGCGG CGGTGAAGGC GCTGGGTCAG
GCGGTCCGGC TCGACCCCGA TAACGCCACC CACCGCGGCA ATCTGGCGAT GCTGCTGGCC
CGCAAGGGCG ATCTGGCCGG CGCCATTGAT CATGCCCGGG CGGCGGTGCG CCTCGCCCCC
AAGGACGGCG CCTTGCGCCT GCGTCTGGCC CGTCTGCTGA TCGCGGCCGG CCATCCCCGC
GAGGCCGAGG CCGAGGCCCT GGACGCCACC CGCCGTCTGC CGCGCGAGGC CGCCAGTTGG
ATTGCCCTGG CCGGAGCCCG GCTGCTTGAC CAGCGCCCCA ATGACGCCGA AGCGCCCAGC
GCCCGCGCCC TAGCCCTGGC CCCCAATGAC GCCGAGGCGC TCAGCCTGCG CACCGATCTG
CTGCTCAGCC TCGGCCGTCT GGCCGAGGCC GAGGCGACGG CCCGCCTTGC CTTGAAGCGG
CAACCCACGT CGCTGGCCGC CCTGGTCGCG CTGTCGAAAG CCAAGACCTT CCGCCCCGAC
GACCCGGACT GGCCGGCCCT GGCGGCCCTG CTGCCCAGCC TGGGCGAGAG GAGCGCCGAA
GAGGCGGTAA AGCTGCATTT CGCCAGCGCC AAGGCGCTGG AGGACATGGG CCGCGACGAC
GAGGCCTTCG CCCATTATCA GGCGGGCAAC AGCCTGAAGG GCAGGGGCCT GCCCGATGAA
CTGCCGGCGC TGAGCGCCAT GGTCGACAGC CTGGAACGCT GGACGCCCAA GCTGCGGCCG
GTCGGCGAGG GCGATCCTTT GCCGGTGTTC ATCGTCGGCA TGCCGCGCTC GGGCACCACG
CTGGTCGAAC AGATCCTTGA CCGCCACCGG GCGATCCATG GCGCCGGCGA GATCTTGCTG
TTTGGCGAAA GGGTGGTCGC CAACGGCCTG GGCGGCTATA GCGCCGATCC GCAAGGCCTT
GATCCCGAGC GGCTAGCGGC TTTGGGGGCG GATTATCGCG ACCGCCTGCG CGGCCTCGCC
CCCCGGGCCG GGCGCATCAT CAACAAGACC CCGGGTAACT GGCTGCATCT GGGGCTGATC
GCCGCCGCCC TGCCCGGCGC CAGGATCATC TGGTGCCGGC GCGATCCCGT CGATTGCTGC
CTGTCGTGCT TTCGCAATCT GTTCGGCCAG GGCCACGCCT GGACCACCGA TCTGGGGCGG
GCCGGGCGCT ACTATCGCCT TCAAGAGCGG CTGACCGGCC ACTGGCAGGC CGTGCTGGGC
GATGAGCGCA TGACCGCCGT TGATTACGAG GCCCTGGTCG CCGACCCGGA GGCCGAGGCC
CGCCGGCTGG TCGCCCACCT TGGCCTGGAG TGGGACGAGG CCTGCCTTGA CCATACCCGG
GGCGGGCGGG CGGTCACCAC CTTGTCCCAG GTCCAGGTCC GCCAGCCGAT CACCGACGCC
TCCGTCGGTC GCGGCCGGCG GTTCCAGACC CACCTCGGGC CGCTTCTCAC CGCCCTGGAC
GGGCGCTGA
 
Protein sequence
MQGHHAAARA LRRQGRLGEA IARYRAALAA TPGAAVLQAE MAECLFAAGQ AEAAVKALGQ 
AVRLDPDNAT HRGNLAMLLA RKGDLAGAID HARAAVRLAP KDGALRLRLA RLLIAAGHPR
EAEAEALDAT RRLPREAASW IALAGARLLD QRPNDAEAPS ARALALAPND AEALSLRTDL
LLSLGRLAEA EATARLALKR QPTSLAALVA LSKAKTFRPD DPDWPALAAL LPSLGERSAE
EAVKLHFASA KALEDMGRDD EAFAHYQAGN SLKGRGLPDE LPALSAMVDS LERWTPKLRP
VGEGDPLPVF IVGMPRSGTT LVEQILDRHR AIHGAGEILL FGERVVANGL GGYSADPQGL
DPERLAALGA DYRDRLRGLA PRAGRIINKT PGNWLHLGLI AAALPGARII WCRRDPVDCC
LSCFRNLFGQ GHAWTTDLGR AGRYYRLQER LTGHWQAVLG DERMTAVDYE ALVADPEAEA
RRLVAHLGLE WDEACLDHTR GGRAVTTLSQ VQVRQPITDA SVGRGRRFQT HLGPLLTALD
GR