Gene YpAngola_A2959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2959 
Symbol 
ID5801431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3120642 
End bp3122576 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content53% 
IMG OID641340803 
Producthypothetical protein 
Protein accessionYP_001607333 
Protein GI162420277 
COG category[S] Function unknown 
COG ID[COG3501] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01646] Rhs element Vgr protein
[TIGR03361] type VI secretion system Vgr family protein 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.0027443 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCACCGTT CGTTACAACA GTTTGCAGAA ACCGATTTCG ACTTTGTTAA TCGTCTGATG 
GAAGACGAAG GCGTTTGGTA TTACTTCGAA CATAACGAAG ATAAGCACAC GCTGGTCATG
ACAGATCAGC AACAATTCCC TGTATTGGAA GGGCATTATG CAGAACTGAG TTTCCTGCCT
GACAGTGAAG AGATGCGCGC CATCCGTGAG GGGATACAGC GTATTCAACG CTCACAACGC
ATCCACTCCA GCGAAATTGT CTTACGTGAC TTCGATTTCC TTAATCCACG CAATACATTA
CAAACCCATA TCGAAGAAAG CCGCCAACAC CTGCAAGGCG TGCCACTGGA ATGGTATGAC
TACGCGGCGG GTTACACCGA TCCCCAGCAT GGTGAGAGCA TCGCCCGTTT ACGCCTTGAA
GCTATACAGA GTAATGGGCA GCTCTTGTCT GGTGAAAGTA ATGCCACAGG ATTAGTGCCA
GGGCGCTCTT TTGCTCTGGT ACAGCACCCT GATAACAACC GTAATCGGGG ATTCAAACTG
ATCAGTTGTG ACTACAGTTT TGTCCAGGAT GGGCCGGACA GTGCGAGCCA GGGACGTAAT
GTCGCCTGTA AGTTTAAGGC ATTGAATGAT GACGTCGTTT ATCGTCCACA ATGTGTCACG
CCACCGCCAA AGGTCCCTGG TGTGCAAAGT GCCACAGTGG TCGGTGCGCG TGAATCAGAA
GTGCATACCG ATAAGTTCGC CCGTATTCGC GTTCACTTTC ACTGGGATCG TTATAAAACC
ACCGAAGATG ATAGCTCCTG CTGGATCCGC GTTGTACAGG CGTGGGCCGG TAAAGGCTGG
GGCGTCCTGG CGATGCCTCG GGTCGGGCAA GAAGTGCTGG TCAATTACGT TGACGGCGAC
CTTGATCGCC CGATGGTGAC CGGGATCGTC TACAACGGTG AGAACCCACC GCCTTACCGT
TTACCTGACC ACATTAACTA CTCCGGTTTT GTCTCACGCT CACTGCGCTT TGGTCAGCCA
CAACACGCCA GCCAGCTTAC CTTCGATGAT AATCGGGGCA ATGAGCGGAT CATGCTACAT
GCTGAGCGCG ACTTACAAAG AACGGTTGAA CGTAACAGTG CGACGGCCGT CGGTCAGGAT
AAATACGACA CGGTGGAACG GACGGCCACG GAGTGGATCA ACAACCATAT CTCCTACAAA
GACTTCAGTT TCTCGGTGAC GGGCATGAGT GTCTCCGCGA CGGGCATCAG TGTCTCGACA
ACGGGAACGA GCTTATCTGT CACCGGCATG AGCACCAGCG TCACGGGTGT CAGTGTGGGT
TTCACCTTGA TAGGGACCTC CTTTACGGGG GTGAGCACAT CGTTTACCGG TGTCGGTACC
TCCTTCACCG GGGCCAGCAA CTCGCTAACC GGTGTCAGCA ACTCGATGAC CGGGTGTAGT
TCCTCCTTTA CCGGTACTAG CAATAGCATG ACAGGCAGTA GCCATAGCAT GACCGGCATG
AGCACCAGCA TCACCGGGCA TAGCATGAGT CAGACGGGTT CCAGTAGCAG CATCACCGGT
GACAGTACCT CCTTTACCGG CAGCAGCGTC AGCAGTACGG GCAGCAGCGT CAGTACGACC
GGTGTCAGTA CCAGCACTAC GGGGAGTAGC ACCTCGACTA CCGGTTGTAG CGTCAGTACT
ACGGGTAGCA GCACCTCGAC TACCGGTAAT TCAGTCAGCA TGACCGGTAA CAGCACCAGT
ACCACGGGAT GCAGTATTTC CACGACGGGC AGCAGTATTG GGACGGTAGG AAGCAGTATC
AGCACCACGG GCAGTAGCGT CAGTACCACC GGTAGCAGTA TCAGCACCAC GGGATTATCC
GTCAGTTATA CCGGCGCTCA ATATTCCGAT GTGGGTGTCG ATCTGAAAAC CGTTGGCATG
CAAAGCAAAA ACTGA
 
Protein sequence
MHRSLQQFAE TDFDFVNRLM EDEGVWYYFE HNEDKHTLVM TDQQQFPVLE GHYAELSFLP 
DSEEMRAIRE GIQRIQRSQR IHSSEIVLRD FDFLNPRNTL QTHIEESRQH LQGVPLEWYD
YAAGYTDPQH GESIARLRLE AIQSNGQLLS GESNATGLVP GRSFALVQHP DNNRNRGFKL
ISCDYSFVQD GPDSASQGRN VACKFKALND DVVYRPQCVT PPPKVPGVQS ATVVGARESE
VHTDKFARIR VHFHWDRYKT TEDDSSCWIR VVQAWAGKGW GVLAMPRVGQ EVLVNYVDGD
LDRPMVTGIV YNGENPPPYR LPDHINYSGF VSRSLRFGQP QHASQLTFDD NRGNERIMLH
AERDLQRTVE RNSATAVGQD KYDTVERTAT EWINNHISYK DFSFSVTGMS VSATGISVST
TGTSLSVTGM STSVTGVSVG FTLIGTSFTG VSTSFTGVGT SFTGASNSLT GVSNSMTGCS
SSFTGTSNSM TGSSHSMTGM STSITGHSMS QTGSSSSITG DSTSFTGSSV SSTGSSVSTT
GVSTSTTGSS TSTTGCSVST TGSSTSTTGN SVSMTGNSTS TTGCSISTTG SSIGTVGSSI
STTGSSVSTT GSSISTTGLS VSYTGAQYSD VGVDLKTVGM QSKN