Gene YpsIP31758_3540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3540 
Symbol 
ID5388453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3999144 
End bp4000379 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content49% 
IMG OID640866555 
Productphage integrase family site specific recombinase 
Protein accessionYP_001402494 
Protein GI153950646 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0000292707 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTGA GTGATGTGAA GGTTCGTTCG GCCAAGCCTG AAGAAAAAGC CTATAAGCTT 
ACCGATGGTG ACGGTATGGT GTTACTGGTT CATCCTAACG GCTCAAAGTA CTGGCGGCTG
CGATATCGCT TTGGCGGCAA AGAAAAGATG TTGGCGCTGG GAATATATCC TGAAATATCG
TTGGCAGATG CCAGAGCACG GCGTGACGAA GCTCGTAAGC AGTTAGCTAA CGGTGTGGAC
CCAAGTGAGA GCAAGAAAGC CGTTAAGGTA GAGCAGGAGC AAGAAGCGAT AACTTTTGAA
GTGGTAGCCA GAGAGTGGCA TGCCAGTAAT CGTCAATGGT CAGAAGCTCA CAGTGCTCGA
GTGCTCAAAA GCTTAGAGGA CAATCTCTTT CAATCCATAG GCAAACGGAA TATCACAGAC
CTCGGAACCC GTGATCTTTT ACCTCCCATT AAGGCCGTAG AGATGTCTGG GCGTCTTGAG
GTGGCTTCCC GCCTGCAACA ACGAACCACA GCGATAATGC GCTATGCCGT TCAAAGCGGT
TTAATTGATT ACAACCCCGC GCAGGAGATG GCGGGCGCTG TTGCTACCGG TAAAAGAAAG
CACCGTGCTG CACTTGAGTT AAACCGTGTT TCAGAGTTAC TTCACCGTAT TGACTACTAC
AGTGGCAGGC CACTCACTCG GCTAGCGGTA GAATTGACTT TATTGGTCTT TATCCGTTCC
AGTGAATTAC GCTTTGCCCG TTGGTCAGAA GTAGATTTTG AAACCGCCAT GTGGACCATC
CCTGGAGAGC GTGAACCACT GGAAGGTGTT AAACACTCGC ATCGGGGCTC AAAAATGCGC
ACTCCCCATC TTGTCCCCTT ATCCCGCCAG GCGCTCGCCA TTCTGGAAAA GATCAAAAGC
ATGAGTGGAA ATCGTGAGCT GATTTTTATC GGTGATCACG ACCCACGTAA GCCCATGAGT
GAGAACACGG TGAACAAAGC CCTACGTGTT ATGGGCTACG ATACTAAAGT AGATGTCTGT
GGGCACGGTT TTAGAACTAT GGCCTGTAGT TCATTGATTG AGTCGGGATT GTGGTCTAGG
GATGCAGTAG AAAGGCAGAT GAGTCACCAG GAGCGCAACT CTGTACGTGC GGCTTATATT
CATAAAGCCG AGCACTTAGA TGAGCGTAAA CTGATGATTC AGTGGTGGGC GGATTTTTTT
GGAGGCAAAC AGGCAGCAAG CTATTTCACC TTTTGA
 
Protein sequence
MALSDVKVRS AKPEEKAYKL TDGDGMVLLV HPNGSKYWRL RYRFGGKEKM LALGIYPEIS 
LADARARRDE ARKQLANGVD PSESKKAVKV EQEQEAITFE VVAREWHASN RQWSEAHSAR
VLKSLEDNLF QSIGKRNITD LGTRDLLPPI KAVEMSGRLE VASRLQQRTT AIMRYAVQSG
LIDYNPAQEM AGAVATGKRK HRAALELNRV SELLHRIDYY SGRPLTRLAV ELTLLVFIRS
SELRFARWSE VDFETAMWTI PGEREPLEGV KHSHRGSKMR TPHLVPLSRQ ALAILEKIKS
MSGNRELIFI GDHDPRKPMS ENTVNKALRV MGYDTKVDVC GHGFRTMACS SLIESGLWSR
DAVERQMSHQ ERNSVRAAYI HKAEHLDERK LMIQWWADFF GGKQAASYFT F