Gene YpsIP31758_2737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2737 
Symbol 
ID5387571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3090502 
End bp3091908 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content55% 
IMG OID640865728 
Productputative DNA circulation protein 
Protein accessionYP_001401702 
Protein GI153948193 
COG category[R] General function prediction only 
COG ID[COG4228] Mu-like prophage DNA circulation protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACTCA TTGGCAACAC ATTATCAGCG TTATTGGGAG GCAGTGACGA CAGCTGGCAA 
TGGTCGGAAC ACCTTCATCG AGCCTCCTTT CGTGGCGTTC CCTTTGTGGT CGTCAGTGGG
CAAGGTACCT TTGGTCGCCG CCAGGTAACA CACAGCTACC CCTATCGCGA TACCAGCTAT
ATCGAAGATT TGGGCCGCAA TACGCGCAAA ATTGTTCTGA AAGGGATTTT GATACAAAAC
AGCCAGATCT ATACCGCACC TGATGTGATG ACTCAACGTG ACTCATTGAT TGCGGCTTGT
GAAATGTCGG GGCCGGGCAC TCTGGTCCAC CCGACACTGG GGGAAATGAC GGTCAGCATC
TCCGAGGCAG GGCTATTGAT CGATGATAGC TTCAGCAGTG AGCGGGTCTT TTCCTTTACC
TTAACCGCCA TCGAGTCTGG CCTGCGTGCC TTTGCTATTA CTGGCTCCGC AGAAATGGGC
GCATCCATTC AATCCTCCTG GCTAGGGCTA AGTGCTAAAG CGGTTGCGGG TTTTATCTCA
ACGGTGAAAG GCGAAATGCG CTCAGCGACT CAGGCGATAA AAACCCTTAA AAATACCGCT
GCATTCTGGC GTCGGATGGT GACGGGCACG GCCAACGAAG CCAGTAATTT GGGCAACGCC
CTACGCTCAA CCTTTGGTCG CAACCGCTAT GGCCGCTATA ACCACGGCAC TGTCGGAGGC
AGCAGCACGG GAGCGACAAC GACGGTTAGC CAACAAAATG ACACGGCGGA TTTATCCTCG
CTGGTGGCGC AACGGATGGC ACTGGTGGTT GAAGGACGGG CGGCGCTCGA CGCGGCGTTG
GACGAGTTAC TCGCCGCCAA CAGTATTGAA AGCCATGCTG ACAGTGTGCT GGCCGTGGTC
AATGCCCTGC TGGCGACGGG CATCAGTACG CGGGATATTA TCCGTATCAT GGAAACCCTG
ACGCTAGCCC ATGACGATAC TTTCCGTGCC AACGACAGTG ATAGGGCCGT CGCGGATGCC
AGCCACCACT TAATGGCCAC ATTATGCACT GGGGCGATGA TCCAAGTGGC AGCGCAATAT
CAACCGGAAA GCTATGACGA TGCGGTTGCG GTATTGGGCC GGGTTTGCCT GGTGATTGAC
AATACTGCAC TGGTCGCCGC CGACAGGGGG AATGATGAGA CCTATCGTGC GCTGGTGCAG
ATGCGTGAAT CTATCGTGAC CGTGCTACAG CAGGCGGGGG CCAATCTATC ACGGGTTGGC
GAGGTCAGTT TTAACCGTTC ACTACCGGCT TTGATGCTGG CAAACCGCCT CTATCAGGAT
GCGTTACGCG GCGATGCGCT GGTGAAAATG GCTAATCCTA TTCACCCGGC ATTTATGCCC
ATCCGATTTA AGGCGCTGAA TCTATGA
 
Protein sequence
MSLIGNTLSA LLGGSDDSWQ WSEHLHRASF RGVPFVVVSG QGTFGRRQVT HSYPYRDTSY 
IEDLGRNTRK IVLKGILIQN SQIYTAPDVM TQRDSLIAAC EMSGPGTLVH PTLGEMTVSI
SEAGLLIDDS FSSERVFSFT LTAIESGLRA FAITGSAEMG ASIQSSWLGL SAKAVAGFIS
TVKGEMRSAT QAIKTLKNTA AFWRRMVTGT ANEASNLGNA LRSTFGRNRY GRYNHGTVGG
SSTGATTTVS QQNDTADLSS LVAQRMALVV EGRAALDAAL DELLAANSIE SHADSVLAVV
NALLATGIST RDIIRIMETL TLAHDDTFRA NDSDRAVADA SHHLMATLCT GAMIQVAAQY
QPESYDDAVA VLGRVCLVID NTALVAADRG NDETYRALVQ MRESIVTVLQ QAGANLSRVG
EVSFNRSLPA LMLANRLYQD ALRGDALVKM ANPIHPAFMP IRFKALNL