Gene Rru_A2001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A2001 
Symbol 
ID3835426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp2310554 
End bp2311606 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content64% 
IMG OID637826101 
Producthypothetical protein 
Protein accessionYP_427088 
Protein GI83593336 
COG category[S] Function unknown 
COG ID[COG3528] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCGCC CGATGTTTTC CCTTCTAGGG GCCGCCGCAC TGGGCTGCGC CGTGGTCGCC 
GCGCCCGCCA GCGCCGCCGA TCCGACCGAA CAGATCGATC CCGATACCGG TACGCCGTCC
TATTATCCCA AGTGGGATGA CGGCACGCTG TCGATTCAGG TTGAAAACGA CAAGTTCGGC
TTTTCCGGGA CCGACCAGCA TTACACCAAC GGCCTTCATG CCACCTGGTT GTCGGGGACC
GGCGATATGC CGATCTGGGC CCAGGAAGTC GGCAACGCCT TGCCGTTCTT TCCGACCAAT
TCGATCAAGC GCTACAGCCT GAGCTTTGGT CAGAGCATTT TCACGCCGTC CGACACCCAG
GCCGATGATC CCGATCCCGA TGATCGCCCC TATGCGGGCT GGACCTATAT CGGCCTGGGC
TATCTGGCCG AAACCGGCAA TACCCTTGAC CGCCTGGAAA TCGACTTGGG CGTGGTCGGT
CCCTGGGCCC TGGGCGAAGA GACCCAGAAC AACTTCCATA GCCTGATCGG CGTCGATACG
GCCAAGGGCT GGGGCTCGCA ATTGCATAAC GAGCCGGGCG CCGTGCTCTA TTACGAACGC
ATGTGGCGGG CCTTGGGCAG CTTCAAGGCC GGTGGCCTGG GCTTCGACTT CTCGCCCCAT
GCCGGCGCCG CCCTGGGCAA CGTTTATACC TATGCCGCCG GTGGCGGCAC CGTGCGGGTC
GGCTTCAACC TGCCCGATGA TTACGGCCCG CCGCGCATCC GCCCCAGCCT TCCCGGCTCG
ACCCAGTTCG AACCGACCGG CGGTCTGGGC GGCTATCTGT TCGCCGGCGT CGAAGGCCGC
GCCGTCGCCC GCAACATCTT CCTCGATGGC AACACCTTCC GCGACAGCCC CAGCGTCGAC
AAGAAGATCT TCGTGGGCGA CGTTCAGGCC GGCGTGGCGG TGACCCTTGG CAATACCCGG
GTGACCTATA CCCAGGCCAT CCGCTCGCCC GAATTCGACG GTCAGGACAA GCCCGATATC
TTCGGATCGA TCAGCCTGTC CTATCGCTTC TAG
 
Protein sequence
MMRPMFSLLG AAALGCAVVA APASAADPTE QIDPDTGTPS YYPKWDDGTL SIQVENDKFG 
FSGTDQHYTN GLHATWLSGT GDMPIWAQEV GNALPFFPTN SIKRYSLSFG QSIFTPSDTQ
ADDPDPDDRP YAGWTYIGLG YLAETGNTLD RLEIDLGVVG PWALGEETQN NFHSLIGVDT
AKGWGSQLHN EPGAVLYYER MWRALGSFKA GGLGFDFSPH AGAALGNVYT YAAGGGTVRV
GFNLPDDYGP PRIRPSLPGS TQFEPTGGLG GYLFAGVEGR AVARNIFLDG NTFRDSPSVD
KKIFVGDVQA GVAVTLGNTR VTYTQAIRSP EFDGQDKPDI FGSISLSYRF