Gene YpAngola_A2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2007 
SymbolflgK 
ID5800477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2093549 
End bp2095213 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content50% 
IMG OID641339928 
Productflagellar hook-associated protein FlgK 
Protein accessionYP_001606478 
Protein GI162421752 
COG category[N] Cell motility 
COG ID[COG1256] Flagellar hook-associated protein 
TIGRFAM ID[TIGR02492] flagellar hook-associated protein FlgK 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAATA GTTTAATGAA TACGGCAATG AGTGGTTTGA ACGCAGCGCA ATATGCCCTG 
AGCACCGTCA GTAACAACAT CACCAACTTT CAAGTAGCCG GTTATAACCG CCAGAACACG
GTCTTTGCCC AAAATGGTGG CACCATTACC TCTGCCGGTT TTATTGGTAA CGGGGTAACG
GTCACCGGGG TTAATCGCGA ATATAACGCT TTTATTACCA ACCAACTGCG CGCCTCTCAG
ACACAAAGCA GTGGCTTGGC CACCTATTAT CAGCAAATTT CGCAAATCGA CAATCTGTTG
TCGAACGCCT CGAATAATCT CTCCACCACC ATGCAGGATT TCTTCAGCAA CCTACAAAAC
CTGGTCAGTA ATGCTGATGA CGATGCTGCG CGCAAAACCG TATTGGGTAA AGCCGAGGGG
TTAGTGAACC AATTCCAGAA CGCAGATAAA TATCTGCGGG ATATGGATGA CGGCGTTAAC
CAAAAAATTA CCGACAGCGC GACGCAAATT AATAACTATG CCGAACAAAT CGCTAAGTTA
AACGACCAGA TCACCCGTCT GCGGGGCAGC AGCGGTAGCG AGCCCAATGC CTTGCTCGAC
CAACGTGACC AACTGGTGAC AGAGTTGAAC CAGATTATGG CGGTTACGGT CACCCAGCAA
GATGGCGATG CTTATAACGT CTCGTTTGCT GGTGGCTTGT CGCTGGTACA AGGACCGAAT
GCCTATAAAG TGGAAGCTAT TCCCTCCAGT GCGGATGCGA CGCGCTTAAC TCTGGGCTAC
AAACGGGGTA ACGGCGAGGC AACGGAGGTA GATGAGAGCC GTATTACTAC CGGGTCGCTT
GGCGGCACCT TGAAATTCCG CAGTGAAGCA CTGGACAGCG CCCGTAACCA GCTAGGTCAG
TTGGCGTTAG TTATGGCTGA TAGCTTCAAT ACGCAGCACA ACGCCGGGTT TGATATCAAT
GGTGATGAGG GGGAGGACTT CTTTAGTTTC GCCGACCCTA CCGTGCTGAA AAATGCCAAA
AATCAGGGTA ATGCCAGCAT CACGGTGGAA TATAAAGACA CATCGAAAGT GAAAGCCAGT
GATTACACCG TAGAATTTGA TGGTACTGAT TGGCAGGTGA CCCGTCTGTC CGATAATACC
AAAGTGCAAA CGACGCCCGG AGTGAATGCT GACGGTGATC CTACGTTGGA ATTCGAGGGT
GTCGCTATCA AGATCGATAA CGGCACGCCC GGCCCGCAGG CGAAAGATAA ATTTACCATT
AAGACCGTGA GTAACGTGGC GGCCAACTTA CAGGTTGCTA TTACCGATTC CAGCAAGATT
GCCGCTGCGG GTAGTGCCGA TGGCGGTATC AGTGACAACA CCAATGCGCA GGCGTTGCTG
GATTTGCAGA GCAAAAAGTT GGTCGAAGGT AAAACCACCT TGTCCGGTGC TTACGCCGGT
TTAGTCAGTA ACGTTGGTAA CCAGACCGCG ACGGCTAAAA CCAACAGCAC TGCACAGGCG
AATATCGTCA CCCAATTAAC CACCGAGCAA CAGTCAATCT CTGGGGTGAA TCTGGATGAA
GAGTACGGCG ATCTACAACG TTTCCAGCAA TATTATTTGG CGAATGCACA GGTCCTGCAA
GCGGCGTCTA CGTTGTTTAA TGCCTTGCTC AGTATAAGTG ACTAA
 
Protein sequence
MSNSLMNTAM SGLNAAQYAL STVSNNITNF QVAGYNRQNT VFAQNGGTIT SAGFIGNGVT 
VTGVNREYNA FITNQLRASQ TQSSGLATYY QQISQIDNLL SNASNNLSTT MQDFFSNLQN
LVSNADDDAA RKTVLGKAEG LVNQFQNADK YLRDMDDGVN QKITDSATQI NNYAEQIAKL
NDQITRLRGS SGSEPNALLD QRDQLVTELN QIMAVTVTQQ DGDAYNVSFA GGLSLVQGPN
AYKVEAIPSS ADATRLTLGY KRGNGEATEV DESRITTGSL GGTLKFRSEA LDSARNQLGQ
LALVMADSFN TQHNAGFDIN GDEGEDFFSF ADPTVLKNAK NQGNASITVE YKDTSKVKAS
DYTVEFDGTD WQVTRLSDNT KVQTTPGVNA DGDPTLEFEG VAIKIDNGTP GPQAKDKFTI
KTVSNVAANL QVAITDSSKI AAAGSADGGI SDNTNAQALL DLQSKKLVEG KTTLSGAYAG
LVSNVGNQTA TAKTNSTAQA NIVTQLTTEQ QSISGVNLDE EYGDLQRFQQ YYLANAQVLQ
AASTLFNALL SISD