Gene YpAngola_A2979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2979 
Symbol 
ID5801451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3139117 
End bp3140106 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content58% 
IMG OID641340821 
Producthypothetical protein 
Protein accessionYP_001607351 
Protein GI162421867 
COG category[S] Function unknown 
COG ID[COG3517] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03349] type IV / VI secretion system protein, DotU family
[TIGR03355] type VI secretion protein, EvpB/VC_A0108 family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.00000658806 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTGTAC AGGAAAACCG TGCGCAGGGT ACTGCGACGA CCGTATTAAA GAATAGCCCT 
GCGGCACAAG GGGTTTACGC CGCCTTGTTT GAAAAAATCA ACCTGAGCCC GGTCTCCTCA
CTGGCCGGTA TCGAGGCGTT CCAGAACAAC GATGCGCTGG CGGAAGCCAC CACCGATGAG
CGCGTTACCG CCGCCGTCAG TGTCTTTTTA GACCTGCTGA AGCAGTCGGC GAAGAAAGTA
GAAAAACTGG ATAAAACCCT GCTGGACGGC CATATTGCCG CACTGGATGA CCAAATCAGC
CGCCAGTTGG ACGCGGTAAT GCACCACCCC GATTTCCAGC GGGTGGAATC GACCTGGCGT
GGTGTGAAGT CGCTGATCGA TCAGACCGAT TTCCGCCAGA ACGTGCGCAT CGAGCTGCTG
GATATCAGTA AAGATCATCT GGTGCAGGAT TTTGAAGATG CCCCGGAAAT CTCACAAAGC
GGTTTATACG CCCAGACCTA TATTCAGGAA TACGACACCC CCGGCGGCGA GCCGATTGCC
GCGGCTATCT CCAACTACGA TGTACAGGAT GATGCCTACT TCGTGTGGTG CCATTCGCCC
TTACAGGCGC ATTTTTTCAA TACCCTGGAT GCCGGTAGCC AGCTTTACGA GCGGATGCGC
GCCGTGTTGC GTGAGCCTGC GCCGGATCGC GCGGTACTGA CCTGTTTTCA CCGGGTATTG
ATGTTGGGCT TCCTCGGCGG GTACGCCTCC CCGGCGGCTT CTGAACGGGA GCAACTGATC
GATCAACTGA GCGTGCAGGT ACCGGCGTTC AGTGTCGCGC CATCACGTGG GATCCTGGCA
AGCGCCGCCT CCCGCAACCG GTTGGGGATC TGGTTGCGTT ACTGGCCGGT ACGGCTCGGG
CTGGCTGCGC TGATGGTTGC GCTGTTGTGG TGGGGCCTTG ATCACTGGCT GTCCGGTCTG
TTAGCCACCT TACTGCCGGA GCCGGTGTAA
 
Protein sequence
MSVQENRAQG TATTVLKNSP AAQGVYAALF EKINLSPVSS LAGIEAFQNN DALAEATTDE 
RVTAAVSVFL DLLKQSAKKV EKLDKTLLDG HIAALDDQIS RQLDAVMHHP DFQRVESTWR
GVKSLIDQTD FRQNVRIELL DISKDHLVQD FEDAPEISQS GLYAQTYIQE YDTPGGEPIA
AAISNYDVQD DAYFVWCHSP LQAHFFNTLD AGSQLYERMR AVLREPAPDR AVLTCFHRVL
MLGFLGGYAS PAASEREQLI DQLSVQVPAF SVAPSRGILA SAASRNRLGI WLRYWPVRLG
LAALMVALLW WGLDHWLSGL LATLLPEPV