Gene Rru_A1154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A1154 
Symbol 
ID3834664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp1365051 
End bp1366166 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content68% 
IMG OID637825243 
Producthypothetical protein 
Protein accessionYP_426242 
Protein GI83592490 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00923322 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTTGGAT CTGGCGGCAC AGGTGCCCCC CCGCCTCGGT CAAGCCCCAT GACCACCCTT 
TACGTCACCC AGCCCGGCTC GGTCGTGCGC TCCGAAGGGG GATCGCTGAC GGTTTGGGTG
GAGACCGAGG CCGACGATCC CGGCCCCAAT GACTCGCCCG TGCGCCGCAA ACGTCTGGCC
TCGGTCGAAC CCCACCGGCT GGAAAGCCTT GTTCTTCTTG GCTTCACCAC CATCACCGCC
AATGCCATGC GCCTGTGCAT GGCCAATAAG ATCGCCGTCT CGCTTCTTGA CGGCGGCGGG
GGATTGGCCG CCCGCGTCGT GCCACCCGAG GCCCGCTCGG CCGACCTGCG CCTGCACCAA
TACGCCCTTC ACTTGGACCC GCCCGAGCGG CTGATCCGCG CCCGCGCCGT CGTCACCGCC
AAATTGCGCA ATGCGGCGGC GGTTCTGCGC GGCATCCGCA GCAATCAGGC CTCCAGCGCC
GCTTTGGCCA GCGCGATCAC CCAAACCGAG GCCAGCGCCG AGGCGGCGGC GGCGGCCGTT
TCGGCGGAAA GCCTGCTGGG AATCGAAGGC AATGGCGCCC ATCAATATTT CGCCGGTCTG
CGCACGGCCT TCGTCGGTGG CATTCCCTTT CTTGGACGGG CCCAACGCCC ACCCCCCGAC
CCGGCCAATT CCCTGCTGTC CTTTGGCTAT GTCTTGCTGG GCAATCGGCT GACCGGCCTG
CTGGAAGCCC GGGGTGTCGA TCCCTGCCTG GGCTTCTTTC ACGATCTGCG ACCGGGACGG
CCGTCGTTAG CCCTGGATCT GCTGGAAGAA CTGCGCCACC CGGTGGTTGA TCGCCTGGCC
CTGCGGATCT GCAATCTGCG CAAGATCCAG CCCCAGCATT TCGAACCCGA CGCCGAGCGC
CCGGGCGGAG TCAAACTCAC GGTCGACGGC CGCAAGATCT TTCTGGAGGA ATGGGAAGGC
CACCTTGCCC GCCCCTTGCG CGAACCGGGC GTGGCCGCCG AGCACCGCCT TGACGTGCAC
CGCCTGCTTC AGCGTCAGGT CGACCGTCTG GTCAGCGACC TGCGCGGCGG CGAACCCTAT
CGCCCGTTCC GCTTTGGCAC CAGCCGCCCG GGCTGA
 
Protein sequence
MVGSGGTGAP PPRSSPMTTL YVTQPGSVVR SEGGSLTVWV ETEADDPGPN DSPVRRKRLA 
SVEPHRLESL VLLGFTTITA NAMRLCMANK IAVSLLDGGG GLAARVVPPE ARSADLRLHQ
YALHLDPPER LIRARAVVTA KLRNAAAVLR GIRSNQASSA ALASAITQTE ASAEAAAAAV
SAESLLGIEG NGAHQYFAGL RTAFVGGIPF LGRAQRPPPD PANSLLSFGY VLLGNRLTGL
LEARGVDPCL GFFHDLRPGR PSLALDLLEE LRHPVVDRLA LRICNLRKIQ PQHFEPDAER
PGGVKLTVDG RKIFLEEWEG HLARPLREPG VAAEHRLDVH RLLQRQVDRL VSDLRGGEPY
RPFRFGTSRP G