Gene YpAngola_A4051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4051 
Symbol 
ID5802530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4309329 
End bp4310588 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content49% 
IMG OID641341835 
Productphage integrase family site specific recombinase 
Protein accessionYP_001608341 
Protein GI162419408 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.102635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTGA GTGATGTGAA GGTTCGTACA GCCAAGCCTG AAGCAAAAGC CTATAAACTT 
ACCGACGGCG AAGGCATGGT GTTACTGGTT CACCCTAACG GCTCGAAATA CTGGCGGCTA
CGCTATCGCT TTGACGGTAA AGAAAAGATG TTGGCGTTGG GGAAGTACCC TGAAATATCA
CTGGCTGATG CCAGGGCACG GCGTGACGAA GCTCGTAAGC AGTTAGCTAA TGGTGTGGAC
CCAAGTGAGA GCAAGAAAGC CGTTAAGGTA GAGCAGGAGC AAGAAGCGAT AACTTTTGAA
GTGGTAGCCA GAGAGTGGCA TGCCAGTAAT CGTCAATGGT CAGAAGCTCA CAGTGCTCGA
GTACTCAAAA GCTTAGAGGA CAATCTCTTT CAAGCCATTG GCAAACGGAA TATCGCAGAC
CTCGGAACCC GTGATCTTTT ACCTCCAATT AAAGCCGTAG AGATGGCTGG ACGTCTTCAG
GTGGCTTCCC GCCTGCAACA ACGAACCACA GCAATAATGC GCTATGCAGT TCAAAGCGGT
TTAATTGATT ACAACCCCGC GCAGGAGATG GCGGGCGCTG TTTCTACCGG TAAAAGAAAG
CACCGCGCTG CACTCGAGTT AAACCGTGTT TCAGAGTTAC TTCATCGCAT CGACTACTAC
AGTGGCAGGC CACTCACTCG ACTAGCGGTA GAATTGACTT TATTGGTCTT TATCCGTTCC
AGTGAATTAC GCTTCGCCCG TTGGTCAGAA GTCGATTTTG AAACTGCAAT GTGGACGATT
CCGGGAGAAC GTGAACCACT GGAAGGTGTT AAGCATTCAC ATCGAGGTTC AAAAATGCGC
ACTCCCCATC TTATCCCCTT ATCCCGTCAA GCTCTCGCCA TTCTGGAAAA GATCAAAAGC
ATGAGTGGAA ACCGTGAGCT GATCTTTATC GGTGATCACG ACCCACGTAA ACCGATGAGT
GAAAACACGG TAAACAAAGC CCTACGCGTG ATGGGATATG ACACGAAAGT TGAAGTCTGT
GGCCACGGTT TCAGAACCAT GGCTTGTAGT TCATTGATTG AGTCTGGATT GTGGTCGAGG
GATGCAGTAG AACGGCAGAT GAGTCATCAG GAGCGCAACT CTGTGCGTGC GGCTTATATC
CATAAAGCCG AGCACTTAGA TGAGCGCAGA TTGATGATTC AGTGGTGGGC TGATTACTTG
GATGCGAATC GGGAGAAGGG GGTTAGTCCG TTTGATTTTG CGAAGTTGAA TACGAGCTGA
 
Protein sequence
MALSDVKVRT AKPEAKAYKL TDGEGMVLLV HPNGSKYWRL RYRFDGKEKM LALGKYPEIS 
LADARARRDE ARKQLANGVD PSESKKAVKV EQEQEAITFE VVAREWHASN RQWSEAHSAR
VLKSLEDNLF QAIGKRNIAD LGTRDLLPPI KAVEMAGRLQ VASRLQQRTT AIMRYAVQSG
LIDYNPAQEM AGAVSTGKRK HRAALELNRV SELLHRIDYY SGRPLTRLAV ELTLLVFIRS
SELRFARWSE VDFETAMWTI PGEREPLEGV KHSHRGSKMR TPHLIPLSRQ ALAILEKIKS
MSGNRELIFI GDHDPRKPMS ENTVNKALRV MGYDTKVEVC GHGFRTMACS SLIESGLWSR
DAVERQMSHQ ERNSVRAAYI HKAEHLDERR LMIQWWADYL DANREKGVSP FDFAKLNTS