Gene YpsIP31758_3429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3429 
Symbol 
ID5385842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3862044 
End bp3864392 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content52% 
IMG OID640866442 
ProductRhs element Vgr protein 
Protein accessionYP_001402384 
Protein GI153947514 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTAA TTGAGATAGA GAATGAGGTT TTATCGGGCC AGGATGCTTT TCCTCTCTCC 
TTCAAGACGC AGGAAAAAGT GTCAGGGAAT CCATCCTACC AACTGGTATT TCAGGTAGCA
GAAGCGGACC TCGATCTGGT CAGTTTGCTG GGGGAAATCA TCAAAGTCAG GATTGAGTTA
CCTGACTCAG CCGGCTACCG GACCTTTTTT ACCTATGTGA TCGCCGGTGC TGATGAAGGG
CAACGGCAAG ACAAATTTGT TTACAGCCTG GAACTGAGTA CCTGGACGTG GTTTTTGATG
CAAAACCGCA ACTGCCGGAT ATTTCAGGAC CTGAATATTA TCGATATTAT CGAACAGGTA
TTTTCCAAAT ATAACTTTGC GGATTACCGT TTTGATATTG TCGGTAACTA TCGGTTACGC
GAGTACTGCG TACAGTTTGC AGAAACCGAT TTCGACTTTG TTAATCGTCT GATGGAAGAC
GAAGGCGTTT GGTATTACTT CGAACATAAC GAAGATAAGC ACACGCTGGT CATGACAGAT
CAGCAACAAT TCCCTGTATT GGAAGGGCAT TATGCAGAAC TGAGTTTCCT GCCTGACAGT
GAAGAGATGC GCGCCATCCG TGAGGGGATA CAGCGTATTC AACGCTCACA ACGCATCCAC
TCCAGCGAAA TTGTCTTACG TGACTTCGAT TTCCTTAATC CACGCAATAC ATTACAAACC
CATATCGAAG AAAGCCGCCA ACACCTGCAA GGCGTACCAC TGGAATGGTA TGACTACGCG
GCGGGTTACA CCGATCCCCA GCATGGTGAG AGCATCGCCC GTTTACGCCT TGAAGCTATA
CAGAGTAATG GGCAGCTCTT GTCTGGTGAA AGTAATGCCA CAGGATTAGT GCCAGGGCGC
TCTTTTGCTC TGGTACAGCA CCCTGATAAC AACCGTAATC GGGGATTCAA ACTGATCAGT
TGTGACTACA GTTTTGTCCA GGATGGGCCG GACAGTGCGA GCCAGGGACG TAATGTCGCC
TGTAAGTTTA AGGCATTGAA TGATGACGTC GTTTATCGTC CACAATGTGT CACGCCACCG
CCAAAGGTCC CTGGTGTGCA AAGTGCCACA GTGGTCGGTG CGCGTGAATC AGAAGTGCAT
ACCGATAAGT TCGCCCGTAT TCGCGTTCAC TTTCACTGGG ATCGTTATAA AACCACCGAA
GATGATAGCT CCTGCTGGAT CCGCGTTGTA CAGGCGTGGG CCGGTAAAGG CTGGGGCGTC
CTGGCGATGC CTCGGGTCGG GCAAGAAGTG CTGGTCAATT ACGTTGACGG CGATCTTGAT
CGCCCGATGG TGACCGGGAT CGTCTACAAC GGTGAGAACC CACCGCCTTA CCGTTTACCT
GACCACATTA ACTACTCCGG TTTTGTCTCA CGCTCACTGC GCTTTGGTCA GCCACAACAC
GCCAGCCAGC TTACCTTCGA TGATAATCGG GGCAATGAGC GGATCATGCT ACATGCTGAG
CGCGACTTAC AAAGAACGGT TGAACGTAAC AGTGCGACGG CCGTCGGTCA GGATAAATAC
GACACGGTGG AACGGACGGC CACGGAGTGG ATCAACAACC ATATCTCCTA CAAAGACTTC
AGTTTCTCGG TGACTGGCAT GAGTGTCTCC GCGACGGGCA TCAGTGTCTC GACAACGGGA
ACGAGCTTAT CTGTCACCGG CATGAGCACC AGCGTTACGG GTGTCAGTGT CGGTTTCACC
TTGATAGGGA CCTCTTTTAC TGGCGTGAGC GCGTCGTTTA CTGGTGTCAG TACCTCTTTC
ACCGGGGCCA GCAACTCGCT AACCGGTGTC AGCAACTCGA TGACCGGGTG TAGTTCCTCC
TTTACCGGTA CTAGCAATAG CATGACAGGC AGTAGCCATA GCATGACCGG CATGAGCACC
AGCATCACCG GGCATAGCAT GAGTCAGACG GGTTCCAGTA GCAGCATCAC CGGTGACAGT
ACCTCCTTTA CCGGCAGCAG CGTCAGCAGT ACGGGCAGCA GCGTCAGTAC GACCGGTGTT
AGCACCAGCA CTACGGGGAG TAGCACCTCG ACTACCGGTT GTAGCGTCAG TACTACGGGT
AGCAGCACCT CGACTACCGG TAATTCAGTC AGCATGACCG GTAACAGCAC CAGTACCACG
GGATGCAGTA TTTCCACGAC GGGCAGCAGT ATTGGGACGG TAGGAAGCAG TATCAGCACC
ACGGGCAGTA GCGTCAGTAC CACCGGTAGC AGTATCAGCA CCACGGGATT ATCCGTCAGT
TATACCGGCG CTCAATATTC CGATGTGGGT GTCGATCTGA AAACCGTTGG CATGCAAAGC
AAAAACTGA
 
Protein sequence
MQLIEIENEV LSGQDAFPLS FKTQEKVSGN PSYQLVFQVA EADLDLVSLL GEIIKVRIEL 
PDSAGYRTFF TYVIAGADEG QRQDKFVYSL ELSTWTWFLM QNRNCRIFQD LNIIDIIEQV
FSKYNFADYR FDIVGNYRLR EYCVQFAETD FDFVNRLMED EGVWYYFEHN EDKHTLVMTD
QQQFPVLEGH YAELSFLPDS EEMRAIREGI QRIQRSQRIH SSEIVLRDFD FLNPRNTLQT
HIEESRQHLQ GVPLEWYDYA AGYTDPQHGE SIARLRLEAI QSNGQLLSGE SNATGLVPGR
SFALVQHPDN NRNRGFKLIS CDYSFVQDGP DSASQGRNVA CKFKALNDDV VYRPQCVTPP
PKVPGVQSAT VVGARESEVH TDKFARIRVH FHWDRYKTTE DDSSCWIRVV QAWAGKGWGV
LAMPRVGQEV LVNYVDGDLD RPMVTGIVYN GENPPPYRLP DHINYSGFVS RSLRFGQPQH
ASQLTFDDNR GNERIMLHAE RDLQRTVERN SATAVGQDKY DTVERTATEW INNHISYKDF
SFSVTGMSVS ATGISVSTTG TSLSVTGMST SVTGVSVGFT LIGTSFTGVS ASFTGVSTSF
TGASNSLTGV SNSMTGCSSS FTGTSNSMTG SSHSMTGMST SITGHSMSQT GSSSSITGDS
TSFTGSSVSS TGSSVSTTGV STSTTGSSTS TTGCSVSTTG SSTSTTGNSV SMTGNSTSTT
GCSISTTGSS IGTVGSSIST TGSSVSTTGS SISTTGLSVS YTGAQYSDVG VDLKTVGMQS
KN