Gene YpAngola_A2698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2698 
Symbol 
ID5801170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2825871 
End bp2826896 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content54% 
IMG OID641340560 
Producthypothetical protein 
Protein accessionYP_001607098 
Protein GI162421796 
COG category[S] Function unknown 
COG ID[COG3520] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03347] type VI secretion protein, VC_A0111 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.010713 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0127027 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACCCG ATGAATCCGT TCACCAGGAA AAACATGCCA ATCTGGACGC CTGGTATCAG 
GAGTCTCAGC CATGGAACGC CGGTTTTATC AGCATGATGC GGGCGAATGC CGCCCGCAAC
CCTACCCTGC CAGCGCCAGG AAAAGCGCCC TTGCCTGAAC AGGAAGCCTT CCGCATCGGG
CAAAGTGCGC ACATGACGTT CTCTCCGCGT GAGATATCTC ACGCGGTCAT GAGAGACGGG
AAAATGGATC TGCAACTTTT TGGCTTAGGG ATTTGGGGGC CAAATGGTGC GATGCCGCTT
CAGATGACCG AACTGGCCTA TACCCGTGCC GAGTTGCATG ATCATACGAT GACGGATTTC
GTCGACCTTT TTCATCACCG CGCATTATCA CAGCTTTATC GGGCGTGGTT TGTCTCTCAG
GATACCGCCT CGCTAGATCG GCAGAGTGAT GAAAAATTCT CCTTCTATGT CGGTAGTCTC
GCCGGGCTGG ATCCTCAAGA ACTTAATGAT ACCGAACTCC CGGTTCATGC CCGGTTAGCC
TCCTCTGCCC ATTTGATTCG TGAAACGCGC AACCCCGAAG GGCTCGTGGG CGCATTGCAG
TACTACTTTG ACGTCCCCGT GCGGATGGTG GAGTACGCCG AGCAGTGGAT CTTTCTGGAA
GAAAGCGACC AGACACAATT GGGTGATGGT GCAGGCGCGA TGCTCTTGGG CGACGGTGCT
ATTTTGGGCA ATACCGTTTT GGATCGGCAG CATAAATTTC AACTGATCCT CGGCCCCCTC
AGCCTGCAAC AGTACCTGCG CTTTAGCCTC TGGGGACAAG ACTTACCGGT ATTGCGAGAG
TGGGTACGTA ACTTTGTTGG GTTCGAATAC GCCTGGGAAG TCCAGTTGTT GTTAAGCGCC
GATGAGGTTC CCATGGCAAC ACTCGATGGC GGACATCAAT TGGGATACAC CTCTTGGTTA
GCCCGCAGTG ATACCACCCT TGATGTCGGT GGGATGAGTT TTGAGCCTGA AATGCATCAC
GATTAA
 
Protein sequence
MLPDESVHQE KHANLDAWYQ ESQPWNAGFI SMMRANAARN PTLPAPGKAP LPEQEAFRIG 
QSAHMTFSPR EISHAVMRDG KMDLQLFGLG IWGPNGAMPL QMTELAYTRA ELHDHTMTDF
VDLFHHRALS QLYRAWFVSQ DTASLDRQSD EKFSFYVGSL AGLDPQELND TELPVHARLA
SSAHLIRETR NPEGLVGALQ YYFDVPVRMV EYAEQWIFLE ESDQTQLGDG AGAMLLGDGA
ILGNTVLDRQ HKFQLILGPL SLQQYLRFSL WGQDLPVLRE WVRNFVGFEY AWEVQLLLSA
DEVPMATLDG GHQLGYTSWL ARSDTTLDVG GMSFEPEMHH D