Gene Rru_A0344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A0344 
Symbol 
ID3834631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp416445 
End bp417590 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content62% 
IMG OID637824427 
ProductCRISPR-associated Cse4 family protein 
Protein accessionYP_425436 
Protein GI83591684 
COG category 
COG ID 
TIGRFAM ID[TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCCGT CCCGTTTTCT GCAAATCCAC AGTTTGCATT CCTATACGGC GGCGCTTTTG 
AACCGGGATG ATTCCGGTCT GGCCAAGCGG CTGACCTATG GCGGATCAAA CCGCACCCGC
ATTTCCTCGC AATGCCTGAA GCGTCACTGG CGGATGGCCG AGCACGACCC CCATGCCCTG
CAGACCCTGG GGGGATACGT TGGCTCGTTC CGCTCGCGCG AATTGGTTAC GGATCTGGTG
ATCAAGCCGC TTGAGGGGCG TTATCCCCAG GACATCCTTG ATGTCCTGGA GCCGGAGTTT
CAGAAACTGG TTTATGGGGA CAAGGCGGAC AAGGGCAAGA AAAGCCGCCA GACCCTGTTG
TTGGGACAGC CCGAACTGGC GTGGCTGGCC CGGCGGGCGG AAGAACTCGC CGCCGGGGCA
AACGATGCGA AAGCCTTGCA AAAGGCCGTC GCCGATTGGC GGAAAGACGC GAATTTCAAG
GCGATGAGCG AGAACGCGGC GCTGCCCGGC GGTCTTGTCG CCGCCTTGTT CGGCCGCATG
GTGACATCCG ATCCGGCGGC CAATATCGAC GCGCCGGTGC ATGTCGCCCA TGCCTTCACC
GTTCATGCCG AAGAGGCGGA GGGCGATTAC TTCACCGCCG TTGATGATCT GAAAAAAGAC
GAGAGCGATA GCGGCGCCGA TACGATCCAG GAAACCGAAC TAACCTCGGG CCTGTTCTAT
GGCTATGTGG TGATCGATCT GCCCGGCCTG ATCGGTAATT GCGGCGGTGA CAAGGAGATC
GCCGCCCAAG TGGTGAATAA TCTTGTCTAT CTCATCGCCG AAGTTTCCCC GGGCGCCAAG
CTGGGCTCCA CCGCGCCCTA TGGCCGCGCC GATCTGATGC TGATCGAAGC GGGCGACCGC
CAGCCCCGCA GTCTGGCGAC GGCCTATCGC AAGGCGATCG CCCCTGATCG CGAACAGGCG
GTGGCGGCTC TGGACGGCTG TTTGGCCAAG CTTGATGCCA CCTATGAGAC GGGGGAGGCC
CGGCGCTATC TGTCGCTGGC CGAAACGCCC TTGACCGGAC CGGCGACCAG CGGCTTGGAA
AAGCTGTCGC TCAAGGCCCT GGCGGACTGG ACGGCGAGCC GGGTGAAGGA GGCTCCCGAT
GCCTGA
 
Protein sequence
MTPSRFLQIH SLHSYTAALL NRDDSGLAKR LTYGGSNRTR ISSQCLKRHW RMAEHDPHAL 
QTLGGYVGSF RSRELVTDLV IKPLEGRYPQ DILDVLEPEF QKLVYGDKAD KGKKSRQTLL
LGQPELAWLA RRAEELAAGA NDAKALQKAV ADWRKDANFK AMSENAALPG GLVAALFGRM
VTSDPAANID APVHVAHAFT VHAEEAEGDY FTAVDDLKKD ESDSGADTIQ ETELTSGLFY
GYVVIDLPGL IGNCGGDKEI AAQVVNNLVY LIAEVSPGAK LGSTAPYGRA DLMLIEAGDR
QPRSLATAYR KAIAPDREQA VAALDGCLAK LDATYETGEA RRYLSLAETP LTGPATSGLE
KLSLKALADW TASRVKEAPD A