Gene YpsIP31758_4037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_4037 
Symbol 
ID5386169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp4547415 
End bp4548743 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content43% 
IMG OID640867067 
Productmetalloprotease 
Protein accessionYP_001402983 
Protein GI153948040 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTACT CCAGGGAACA CCGCAACACT ATAATTAAAA ATGAACATGT CATGCGCAGA 
GGCATTCATT ATAAGAATGA AATTAAAGGT GTTATAGCAC CACAAATATC TAGCCATCAG
TCATGGAAAG AAAACACTAT TCATAATAAA AATACAAACC TGACATATTC ATTTAGTCGA
GCATATACAT TATGGGATTA TGATCGAACG TTCCAACAAA ACGCTTATGT CTCATTATTT
AATCCAGCCC AAATCCATCA GGCAAAAATC GCGATGCAAT CTTGGGCTGA TGTAGCCAAC
ATCTCCTTCA CCGAAGCATC AGCAGACTCT TCCGCCAATA TTCTATTTTT AAATTTTCAG
CGCCCAGGCA ATGTGGCAGG TTATGCCTAT CATCCTAATC TAGGGAGTTT CAGCCCAATA
TGGATTAATT ACAGCTTCAG CGATAACCAA CATCCCAGCA GATTAAATTA TGGTGGCGGG
GTATTAACAC ATGAGATTGG CCATGCTCTG GGGTTGGGTC ATTCTCATGC CCCCCATGGC
TACACGCAAC AAATGAGTGT GATGAGCTAT TTATCCGAAC AGGATTCAGG CGCGAACTAT
GGCCAACATT ACTTATCCAC GCCACAAATG TACGATATCG CCGCAATCCA GTATCTGTAT
GGGGCTAATC TACACACCCG CACCGGTGAT ACCGTTTATG GCTTCAACTC GACGAGTTAT
AGAGATCATT TCACCGCCAC CCACGCCAGT GATGCGTTGA TTTTCTGTGT CTGGGATGCT
GGCGGCAATG ATACTTTTGA CTTCTCTGGC TATAAGCAAA ATCAAATGAT TAATCTTAAC
GAACTCTGTT TTTCTGATGT TGGTGGACTA AAAGGAAATG TGTCTATTGC AGCGGATGTT
ACGATTGAAA ATGCCATCGG CGGCAGTGGC CATGATGATA TTATCGGCAA TCACACCAAT
AATATTTTGA CCGGTAACGG CGGATCTGAT CAACTTTGGG GTAACGGGGG CAATAATACT
TTCCGCTATG CCAGTGCCAG AGATTCAATG ACCACCTCGC CCGATACTAT TCATGATTTT
AAATCAGGCC GTGACAAGAT AGATTTGTCG CAATTAATGC CCTCAACCGA CCGTGTTATT
TTTGTCGATA GATTAAGTTT TAACGGTCAA ACAGAGATGG GGCAGCAATA TAATGAAGTG
GCGGACATAA CTTATCTTAT GATCGATTTT GACGCTCAAG TCAGCGAGTG CGATATGATG
ATTAAATTTA CCGGCAGGCA CCATTTCACC GCCAATGACT TTATTTTAAG TACGTCACTG
ACGGCATAA
 
Protein sequence
MSYSREHRNT IIKNEHVMRR GIHYKNEIKG VIAPQISSHQ SWKENTIHNK NTNLTYSFSR 
AYTLWDYDRT FQQNAYVSLF NPAQIHQAKI AMQSWADVAN ISFTEASADS SANILFLNFQ
RPGNVAGYAY HPNLGSFSPI WINYSFSDNQ HPSRLNYGGG VLTHEIGHAL GLGHSHAPHG
YTQQMSVMSY LSEQDSGANY GQHYLSTPQM YDIAAIQYLY GANLHTRTGD TVYGFNSTSY
RDHFTATHAS DALIFCVWDA GGNDTFDFSG YKQNQMINLN ELCFSDVGGL KGNVSIAADV
TIENAIGGSG HDDIIGNHTN NILTGNGGSD QLWGNGGNNT FRYASARDSM TTSPDTIHDF
KSGRDKIDLS QLMPSTDRVI FVDRLSFNGQ TEMGQQYNEV ADITYLMIDF DAQVSECDMM
IKFTGRHHFT ANDFILSTSL TA