Gene YpAngola_A4153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4153 
Symbol 
ID5802633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4443850 
End bp4445043 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content46% 
IMG OID641341926 
Productphage integrase family site specific recombinase 
Protein accessionYP_001608430 
Protein GI162418700 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.448481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0214927 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACTAT CAGATAGACA GATTCGCCGT GCTAAACCAC AAGAAAAAGC CTATACCCTG 
AGTGACGGGC AAGGGCTATC ACTCTTAATT GAACCTAACG GCAGTAAAGG GTGGCGCTTT
CGCTATCGTT TTGCCGGTAA AGCCCGATTA ATGTCATTAG GGACTTATGA TCTTGTTTCT
CTTGCTGAAG CTCGCTCTAA GCGAGATGTG GCACGTAAAC AGGTTGCAGA TGGAACAGAT
CCCGCAGAGG TGAAAAAGGC TGAGAAGTTG GCGCAGCGGC TTTCATCGGA AAAATCTTTT
GAGGCCATTA GCCGTGAGTG GCATAAAGCC AAAGCCGATC GCTGGTCATT GGGCTATAGG
GAAGAAATCA TGAGTACCTT TGAGGCGGAT ATATTCCCGT ATATTGGTAA ACGGCCAATT
GCAGAAATCA CTCCGCTGGA GTTACTTGAT GTGCTTCAAC GTATCGAAAA GCGTGGAGCG
TTAGAAAAAA CGCGCAAAGT ACGCCAGCGC TGTGGTGAAG TATTTCGTTA TGCCATTATT
ACCGGCCGTG CTGAATATAA TCCCGCCCCT GACCTTGCCA GCGCATTAAG TACACCAAAG
AAACAGCACT ACCCGTTCCT GTCTGCCGAA GAGATGCCAT ATTTTATTCG TGATCTAGAG
GGTTACACCG GCAGCATCAT AACCAAGAAC GCGGCAAAGA TACTTATGCT AACGGGTGTG
CGAACTAAGG AAATGCGTTT TGCTACTTGG CAGGAAATCG ATCTTGAGGG CGGCTTGTGG
GAGATTCCAG CAGAACGAAT GAAGATGCGC CGCCCCCATA TCGTGCCGCT GTCTACACAA
GTTATAGCGC TATTTAAACA GCTCTTACCT ATCACCGGGC ATTACCCTTA TATTTTCATT
GGACTGAATG ACCGTAAAAA GCCCATAAGT AAAGAAACAG TCAATCAGGT CATTGAATTA
CTGGGTTATA AAGGCCGTGC GACGGGTCAC GGATTTAGAC ATACCATGTC AACGATATTG
CATGAGCAGG GCTATGATAG TGCATGGATT GAGCTACAAT TGGCTCACGT TGATAAGAAC
AGTATTCGTG GTACTTACAA TCATGCGCAG TATCTTGAAA AGCGAAGAGA GATGTTGCAG
TGGTATGCTG ATCTCATTTT TAATTTTTTC AGGAAAAATT ATGAGCGCAT CTAA
 
Protein sequence
MPLSDRQIRR AKPQEKAYTL SDGQGLSLLI EPNGSKGWRF RYRFAGKARL MSLGTYDLVS 
LAEARSKRDV ARKQVADGTD PAEVKKAEKL AQRLSSEKSF EAISREWHKA KADRWSLGYR
EEIMSTFEAD IFPYIGKRPI AEITPLELLD VLQRIEKRGA LEKTRKVRQR CGEVFRYAII
TGRAEYNPAP DLASALSTPK KQHYPFLSAE EMPYFIRDLE GYTGSIITKN AAKILMLTGV
RTKEMRFATW QEIDLEGGLW EIPAERMKMR RPHIVPLSTQ VIALFKQLLP ITGHYPYIFI
GLNDRKKPIS KETVNQVIEL LGYKGRATGH GFRHTMSTIL HEQGYDSAWI ELQLAHVDKN
SIRGTYNHAQ YLEKRREMLQ WYADLIFNFF RKNYERI