Gene YpAngola_A2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2022 
SymbolfliD 
ID5800492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2110148 
End bp2111548 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content47% 
IMG OID641339942 
Productflagellar capping protein 
Protein accessionYP_001606492 
Protein GI162420174 
COG category[N] Cell motility 
COG ID[COG1345] Flagellar capping protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGTA TCAGTGCATT AGGTACCGGT TCAGGACTGG ATTTAAATAC ATTGCTGTCA 
CAATTATCGG CCGCAGAACA GACACGCCTT ACCCCGCTAA CCACGCAGCA AACCAGTTAT
AAAAGCAAAC TTACCGCTTA TGGTGTTTTG CAAAGTGCGT TGGCAAAGCT TGAAACCGCG
TCTACCGCCC TGAAAAAAGC TGACACGTTG AACTCTACGG CAGTCAGCGG CAGCAATTCC
GCATTCAGCG CGACAACGGA TAGTGCGGCC AGTGCAGGTA CGTACAGTAT TGAAGTCACC
AATCTGGCCA AAGCACAATC GCTGTTATCG GCAGATGTGC CTAGCGCTAC AGATAAATTG
GGTAGCAGTG ATGCAACACG CACGATCACG ATTACTCAAC CCGGCCAAAA AGAGCCGATG
AAAATCAGCC TGACCAGCGA GCAAACTTCG CTGACGGGTA TCCGTGATGC TATTAATAAG
CAGGAAGGCA GTGTCAACGC CAGTATTATG AAAGCAGATG ATAATACTTA TTATCTGGCA
TTAACGTCGA AGGATACCGG TACGCAATCA GAAATGACCA TCAGTGTAGC GGGTGATGAA
ACCTTAAATA ATTTCCTGAA CTATACCCCC AGCAGCACCG GCGGTAGTGG TGCGCTGACG
CAAAAAGTCA AAGCGGAAGA CGCCACCTTG AGTGTTAATG GTGTCAGTAT TACCCGCCAA
AGTAATACCA TTACCGATGC ACCTCAGGGC GTCACCATCA ATCTGAAAGC CGTCACCAAA
GAGGGCGCGC CAGAGCAACT GACTATTGTC CGTGACAACA CGGCCACAAA GGCGGCGATC
CAAAGTTTTG TCGATGCCTA TAACTCATTA CAAACCACCT TCGGCTCACT GACCAAATAT
ACCGTTGTAG AGACAGGCCA GGATCAATCC ACGAGTAATG GTGCATTAGT CGGGGATGGC
ACATTGCGTT CTATCCAGAC GCAATTAAAG AGCCAATTGG CCTCTAGTCA ATCTGGGGAT
TTGAAAACCT TAGCCAGCAT CGGGATCACG CAAGATCTCG ACGGTAAGTT GGTTATCAAC
GCCGATAAAT TAAATACTGC ACTCACCGAT AAACCCAATA GCGTCACTGC ATTTTTTGTG
GGTGATGGGG AAACCACGGG TTTTGCTACT CAGACTGAAA AGTTGCTCAA TACCGCATTG
GATACGACAT TAGGGACATT AAAAACAGCC ACTGACGGTA TTAATACTTC ACTAAAAAAC
CTGGATAAAC AAATTGCGGC AACGACTGCC AGTATAGAAA CAGCCATTGA GCGCTATAAA
ACACAATTTA CTCAATTGGA TAAGCTGATG ACATCAATGA ACAGTACAGC CAGTTTCTTA
ACACAACAAT TTGATTCATA A
 
Protein sequence
MASISALGTG SGLDLNTLLS QLSAAEQTRL TPLTTQQTSY KSKLTAYGVL QSALAKLETA 
STALKKADTL NSTAVSGSNS AFSATTDSAA SAGTYSIEVT NLAKAQSLLS ADVPSATDKL
GSSDATRTIT ITQPGQKEPM KISLTSEQTS LTGIRDAINK QEGSVNASIM KADDNTYYLA
LTSKDTGTQS EMTISVAGDE TLNNFLNYTP SSTGGSGALT QKVKAEDATL SVNGVSITRQ
SNTITDAPQG VTINLKAVTK EGAPEQLTIV RDNTATKAAI QSFVDAYNSL QTTFGSLTKY
TVVETGQDQS TSNGALVGDG TLRSIQTQLK SQLASSQSGD LKTLASIGIT QDLDGKLVIN
ADKLNTALTD KPNSVTAFFV GDGETTGFAT QTEKLLNTAL DTTLGTLKTA TDGINTSLKN
LDKQIAATTA SIETAIERYK TQFTQLDKLM TSMNSTASFL TQQFDS