Gene Rru_A2854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A2854 
Symbol 
ID3836294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3285287 
End bp3287044 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content73% 
IMG OID637826965 
Producthypothetical protein 
Protein accessionYP_427938 
Protein GI83594186 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCCC GTCTCGCCCT TGCCTTGAAG CTCCTTGAGG CCGGTCTGGT CGAAAGCGCG 
GCGATTCCCG TCGCCCAGGC CCTGGAGCGC GATCCCGACG ATCCCCGCGC CCACCATCTG
GCCGGGCTGA TCGCCCGCCG CGGCGGCGAT GGGGCCCTGG CCCTAGCGGC CTTTACCCGG
GGGCTGGCCG TGGCGGCTGA CGATGCGCCG CTGCGGATCG AGCGCGCCGG CCTGCTGCTT
GATCTTGGCC GGGCCGAGGA GGCCCTGGCC GATCTGGCGG TGGCCCTGGC CCTGGCCCCC
GATCACCAAG CGGTTCTCTC GACCCAGGGG CGGGCCCTGA TCGACCTTGG TCGGCCGGGC
GAAGCCCTTG CCCCCCTGCG CCGGGCCCGC GCCCTTGCCC CCGACGACGT GGCCCCCGCC
CATAACCTGG GGCGGGCCCT GCTGGCCCTG GATCGTCCCG ACGAGGCGCT GCCGCTTCTG
GAAGCCGCCG TCGCCCGGGC GCCGGGGGCG ATCGACCCGC GCGCCGATCT GGCCGAGGCC
TTGCGCCGTC TTGATCGGCC GGGCGATGCC GCCCAGCTGT TGCGCGCCGT CCTGGCCGAC
GCCCCCGGGC GCGCCGATTT GGCCGATGCC CTGGCCGGAG CCCTGTTGGC CGCCGGGGCG
GTCGACGAGA GCGTGGCGGT GCTGCGCGCC AGCCTGGACA AGGCGCCGAC CCATGGCGCG
GGATGGGTCA ATCTGGCCGC CGCCCTGATC GAGACCGGCG CCCTGGACGA GGCGACGGCG
GCCGCCGACC GGGCCCTGGC CCTTGATCCC GACGATGCCG ATGCGAAGGT CAACCGCGCC
TTCGCCCGCT GTCTTGCCGA TGATTATGAA CAGGGGTTCG CCGATTATGC CCATCGCTGG
CGCACGGCGG CCTTTCAGCG GCCCTATCCG CCGGTCACGG CGCCGCCATG GGCGGGCGAG
CCCTTGGGCG AGGGGACGCT GCTGGTGCGT GGCGAGCAGG GGTTGGGCGA TCAGATCATG
GCGGCGCGCT TCCTGCCCTG GCTGAGCAAG CGCCCCGATC GTCCGCGGCG GATCGTCTTT
GAATGCCACC CCTGCCTGCA TCGCTTGCTG GCGGAGGGCC TGGAGGCCGG GATCGACCTT
CTGGCCATGG GCCGCCCGCC GCCGCCCGTG GCCGCCTGGA TCGGCGCCCT TGATCTTGCC
CGCATGGCCG GGGTCACGGC CGGCGTCATG CCCGTGGATG TGCCGTATCT GAAGGCTCGG
CCGCCGGTGC CGCCCTTGGC GCCCCCGCTG CCGCCGGGGG GGCGGGGTCG GCTGGGGCTG
GTCTGGGCCG GCAAGACCCG GCCGCGCGAC CGTTCCTGTC CGCTCGAGCC TTTAGCCGGC
CTGTGTGTCG ATCAGGGATG GACGGTCCAT GCCCTGCAGT TGGGACCGCG CCGCGCCGAT
CTGGCCGGTT TGCCCGCCGG CCTGGGAGTG ATCGACGAGG GGGACCGGCT GGGCGACATG
GCGGCGACGG CGTCGGTGAT GGCCGGGCTT GATCTGGTGG TGGCCGTCGA TACGGCGGTG
GCCCATCTGG CCGGAGCGCT TGGCCTGCCT TGCGCCCTGC TGCTGCTTGC CACCCCCGAC
TGGCGGTGGG GGCAAAAGGC GTCGCGCACG GTGTGGTACC CCTCGCTCCG CCTGTTCCGC
CAGCCCCATC CCGGCGATTG GGATGGGGCC TTTGATGCGG TGCGGCGAGC CTTCGCCCAA
GGCTGGCCGC TTTCATAG
 
Protein sequence
MNPRLALALK LLEAGLVESA AIPVAQALER DPDDPRAHHL AGLIARRGGD GALALAAFTR 
GLAVAADDAP LRIERAGLLL DLGRAEEALA DLAVALALAP DHQAVLSTQG RALIDLGRPG
EALAPLRRAR ALAPDDVAPA HNLGRALLAL DRPDEALPLL EAAVARAPGA IDPRADLAEA
LRRLDRPGDA AQLLRAVLAD APGRADLADA LAGALLAAGA VDESVAVLRA SLDKAPTHGA
GWVNLAAALI ETGALDEATA AADRALALDP DDADAKVNRA FARCLADDYE QGFADYAHRW
RTAAFQRPYP PVTAPPWAGE PLGEGTLLVR GEQGLGDQIM AARFLPWLSK RPDRPRRIVF
ECHPCLHRLL AEGLEAGIDL LAMGRPPPPV AAWIGALDLA RMAGVTAGVM PVDVPYLKAR
PPVPPLAPPL PPGGRGRLGL VWAGKTRPRD RSCPLEPLAG LCVDQGWTVH ALQLGPRRAD
LAGLPAGLGV IDEGDRLGDM AATASVMAGL DLVVAVDTAV AHLAGALGLP CALLLLATPD
WRWGQKASRT VWYPSLRLFR QPHPGDWDGA FDAVRRAFAQ GWPLS