Gene YpAngola_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_0033 
Symbol 
ID5798403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010158 
Strand
Start bp21872 
End bp22948 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content55% 
IMG OID641337934 
ProductRecA protein 
Protein accessionYP_001604551 
Protein GI162417927 
COG category[L] Replication, recombination and repair 
COG ID[COG0468] RecA/RadA recombinase 
TIGRFAM ID[TIGR02012] protein RecA 


Plasmid Coverage information

Num covering plasmid clones298 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones424 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAG GCAAATCCGC ACTGGCACTG GCGCTGAAAA AGAAAATCGG CAGCAATGAC 
GAGATTCAGA AGGTCTCCCA CTGGATTGAC TCCGGTTTCC CTCCACTGAA CAAAGCCATT
TCCGGACGTT ACGACGGTGG TTTTCCGTGT GGGCGTATCG TTGAAGTCTT CGGGCCACCA
AGCGCCGGTA AAACCTTTTT GGCGACGGCT GCGATGGTAT CAGCACAGAA ACAGGATGGT
CTGGCCGTAT TCCTTGACCA CGAAAACAGC TTCGACGTTG GTCTTGCGGT GGCGAATGGC
TTGAACGCCG ACGAAGACGA CGGTCAGTGG GTCTACAAAC AGCCGGATAC CTTCGAAGAC
TCCGTTGAGC TGATCGGCAC AATCCTCAAG CTGGTGCGCG ACGAAGAGCT TATCCCGGAA
ACAGCCCCTA TCTGCATCGT TGCCGACTCT CTGGCGTCGA TGGTACCGAA CTCGAAAGCT
GAGAAGTTCG ACAAGATGGC AGAAGGCACT GCGAAGGACA AAGATCAGCT GAACATGAAC
GACAACACGG CGCTGGCGCG CGCGACGAGT GCGAACTTCC CTACTCTGGC GCTTTGGGCG
CGTAAGTACA ACGCGTGCAT TATCTTCTTA AACCAGGTGC GTACCAAAAT TGGCGTGATG
TTTGGCGATC CGACTACGTC TCCGGGCGGC GACTCTCCGA AGTTCTACGC GTCGGTGCGC
ATCCGTCTGG GAGCATCCGT CATGAAGGAT GGCAAAGAGA AGATCGGACA GGACGTTGGC
GCCGAGTGCA TTAAAAACAA AGTCGCGCCT CCGTTTGGTA AATGCTCATG GAAATTCTAC
TTCGACCCGA CTCGCGGGCT GGACGTCATC GAATCTCTGG TTGAGTACAT GCTGGAAGAA
GGATACCTGC CAAAGAACGC CAGCGGGCGT GTGGAAATTG GCGATAAGAG ATATACCAAA
TCGCAGATCG TCGAGATGTA CCGCGAGAAG CCACTCCCGG AAATCATCGC AGCACTCCAG
GCGATAGACG AACGGCGAGC GAAAGAGTCG TCCCCAGCAG AGACAGAAGA AGCGTAA
 
Protein sequence
MAKGKSALAL ALKKKIGSND EIQKVSHWID SGFPPLNKAI SGRYDGGFPC GRIVEVFGPP 
SAGKTFLATA AMVSAQKQDG LAVFLDHENS FDVGLAVANG LNADEDDGQW VYKQPDTFED
SVELIGTILK LVRDEELIPE TAPICIVADS LASMVPNSKA EKFDKMAEGT AKDKDQLNMN
DNTALARATS ANFPTLALWA RKYNACIIFL NQVRTKIGVM FGDPTTSPGG DSPKFYASVR
IRLGASVMKD GKEKIGQDVG AECIKNKVAP PFGKCSWKFY FDPTRGLDVI ESLVEYMLEE
GYLPKNASGR VEIGDKRYTK SQIVEMYREK PLPEIIAALQ AIDERRAKES SPAETEEA