Gene YpAngola_A3853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3853 
Symbol 
ID5802331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4090283 
End bp4091497 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content49% 
IMG OID641341648 
Productphage integrase family site specific recombinase 
Protein accessionYP_001608158 
Protein GI162418465 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0234676 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTAA ACGCCCGGCA GGTTGAAACA GCCAAACCCA AAGACAAAGC CTATAAGCTC 
GCGGATGGGG GCGGTTTGTA CTTATTGGTC AATACCAATG GCTCGCGTTA TTGGCGTTTA
AAGTACCGCT TTGCGGGGAA AGAGAAGTTG TTGGCCCTGG GGGTGTATCC AGACGTGTCG
TTAGCGGTTG CCAGAGTAAA ACGTGACGAG GCTAAGAAGA TTGTCGCTGG TGGCGGTGAT
CCCAGCCAGA ACAAACAACA GGAAAAACTA GCCCGACAAG GTGAAGCGAC GAATACCTTT
GAGGCTATCA CCCGAGAGTG GTATCAACGG CGTTACGATA GGTGGTCTGA ATCTTATCGT
TTAGAAATGA TGAGCACGTT TGAAAGTGAC GTTTTTCCAT ATATCGGTTA CCGGCCAATC
AAAGAAATTA AACCGCTTGA GCTGATGGCC GTGCTGTCCA AGTTAGAAAA ACGGGGTGCC
ACTGAGAAAA TGCGTAAGGT TCGGCAGCGA TGTGGTGAGG TTTGGAAATA CGCCATCATC
ACAGGCCGAG CCGAATATAA TCCCGCCCCT GACCTTGCCA GTGCTTTTGC TCCTCATAAA
CGCGAACACT ATCCACATCT TACGATTAAT GAAATCCCTG AATTTCTTTC CAGCCTTGCC
AGTTATAGCG GCAGCATGTT GGTTAAGTTG GCTATGCGGT TGTTGATCCT CACCGGTACC
CGCCCAGGTG AATTGCGGCA GGCCGAATGG GCAGAGTTTG ATTTTGATAA TGCCTTATGG
GAGATCCCCG CGGTGCGAAT GAAGATGCGT CGGCCACATA TGGTGCCGCT TCCGGATCAA
GCCATTACGA TACTAAAGCA GATTCAACCT ATCAGCGGGC GCTATCAGTT TGTGTTCCCT
GGCAGAATAC AGCACAGCAA GCCTATCAGT GAGATGACAC TCAATGTACT AATCCGACGT
ATCGGTTATG GCGGCAGAGC CACTGGACAT GGTTTTCGCC ACACAATGAG CACTATTTTG
CATGAGCAAG GTTTCAATAC GGCCTGGATA GAAACACAAC TGGCTCATGT CGATAAAAAC
AGTATTCGCG GTACTTATAA TCACGCTCAA TATCTTGATG GACGCCGTGA AATGCTTCAA
TGGTATGCCG ACTATATGGA TAGTCTGGAA AATGGCGAGA ATGTGATTCA TGGCAGATTT
GGGAAACGGG CATAG
 
Protein sequence
MKLNARQVET AKPKDKAYKL ADGGGLYLLV NTNGSRYWRL KYRFAGKEKL LALGVYPDVS 
LAVARVKRDE AKKIVAGGGD PSQNKQQEKL ARQGEATNTF EAITREWYQR RYDRWSESYR
LEMMSTFESD VFPYIGYRPI KEIKPLELMA VLSKLEKRGA TEKMRKVRQR CGEVWKYAII
TGRAEYNPAP DLASAFAPHK REHYPHLTIN EIPEFLSSLA SYSGSMLVKL AMRLLILTGT
RPGELRQAEW AEFDFDNALW EIPAVRMKMR RPHMVPLPDQ AITILKQIQP ISGRYQFVFP
GRIQHSKPIS EMTLNVLIRR IGYGGRATGH GFRHTMSTIL HEQGFNTAWI ETQLAHVDKN
SIRGTYNHAQ YLDGRREMLQ WYADYMDSLE NGENVIHGRF GKRA