Gene YpAngola_A0208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0208 
Symbol 
ID5798673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp223119 
End bp224285 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content50% 
IMG OID641338226 
Productlateral flagellin 
Protein accessionYP_001604832 
Protein GI162418505 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTGT CTATCCACAC CAATGCTTCT GCTAAAACAG CCATCAACAG CCTGAGTAAT 
GCGGGCCTGG CGAATGCCAA ATCTTCACAG CGCCTGTCTA CCGGTTTCCG CATTAACTCA
CCCGCCGATA ACGCGGCAGG CTTGCAGATC ACCAACCGTA TGGAGAAGTT TTTAAACAGC
GCGGGCCAGG CTAAGCAGAA CATTCAAGAA TCCATCGCCA TGCTGCAAAT CGCCGATGGT
GGCTTGGCTG AATCGGTCAA AACCCTGAAC GCCATGAAGA AGCTGGCAAC GCAGGCGGCG
AACGACACTA ACTCTGCGGC TGACCGCGAG GCTATCCAAA AAGAATTTAG CGAACTGGGT
AAAGAGCTGC AAAACGCGCT GAACAACACC GAATATAACT CCGAGAAGCT GTTTGCTGAT
GGCGGCAAAA TGCGTAAGGA ATTGAATTTC CAGAGCGGTA CCGATGCAGA ATCTAGCCTG
AAATTAGATC TGAATAGCGT GATTGCAGAG CTGACTGAGA GTGTGACCAA GCAGGCACCG
AAAATCACCG GAAAATCATC GAGCGCTACT GGGTCGTTGG AGAAGCAAGC TTATGATTTG
GATAAGGCAG TTACAGATAC TAAAAGTCTT GTAGCTGGTG CTGAAGGCGT CCAAAAGACT
CTGGAACACG ATTTTGCAGC TTCTGGTAAT AAAGCAGTGG CAGAAATTAA GATCCCTGAA
TATAAGGATG CGCTGGGTAA GACCGTCCCA GAGGTGGTAA TTGCTCTTGG TGCCGTTATC
ACTTCAGCCA ATAGCAACCA GATGAAGGAT GCGGTTGCTG CCCTGAAAAC AACACATGAC
GCTGCGGTCA AGGCAGAAGC CACATTCCAG GCTAAAAACT CTACCGGTGG TGGTGTGATG
AATATGCAGC TGGCCGATAA AGATCTGGCG ATGAAGGCAG ATAAAAAGCT GTCAGACGTG
ATTGACGCCT ATGGTGCTTT CCGTGCCACG CTGGGGGCGA ACCAAAACCG CCTGCAATCC
TCTTCCAATA ACCTGGATAA CATGATCAGC AACACCGCGC AGGCGCTGGG CAGCATCAAA
GATACCGATT TTGCGAGCCT GTCCATAATT CTGTGTAACT GCCACCGTAT TAAAGGTGAT
CGCTCAGGCG GTCACCGAAC TCGATAA
 
Protein sequence
MSLSIHTNAS AKTAINSLSN AGLANAKSSQ RLSTGFRINS PADNAAGLQI TNRMEKFLNS 
AGQAKQNIQE SIAMLQIADG GLAESVKTLN AMKKLATQAA NDTNSAADRE AIQKEFSELG
KELQNALNNT EYNSEKLFAD GGKMRKELNF QSGTDAESSL KLDLNSVIAE LTESVTKQAP
KITGKSSSAT GSLEKQAYDL DKAVTDTKSL VAGAEGVQKT LEHDFAASGN KAVAEIKIPE
YKDALGKTVP EVVIALGAVI TSANSNQMKD AVAALKTTHD AAVKAEATFQ AKNSTGGGVM
NMQLADKDLA MKADKKLSDV IDAYGAFRAT LGANQNRLQS SSNNLDNMIS NTAQALGSIK
DTDFASLSII LCNCHRIKGD RSGGHRTR