Gene YpsIP31758_2281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2281 
Symbol 
ID5386178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2584748 
End bp2585857 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content47% 
IMG OID640865268 
Productflagellin 
Protein accessionYP_001401250 
Protein GI153950689 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGGTAA TTAACACAAA CAGCTTGTCC CTGCTGACTC AGAACAACTT GAATAAATCT 
CAGTCTTCTT TAGGCACCGC CATTGAGCGT TTGTCTTCCG GTCTGCGTAT CAACAGTGCA
AAAGACGATG CCGCTGGTCA GGCGATTGCT AACCGTTTCA CCTCTAACAT CAAAGGCCTG
ACTCAGGCTG CACGTAACGC CAACGACGGT ATCTCTATCG CTCAGACTAC TGAAGGTTCT
CTGAACGAAA TCAACAACAA CTTGCAGCGT GTACGTGAAC TGACTGTACA GGCGCAAAAC
GGTTCGAACT CAAGTTCTGA CCTGGACTCT ATTCAGGATG AAATCAGCCT GCGTTTGGCT
GAAATTGACC GTGTATCTGA TCAGACCCAA TTCAACGGTA AAAAAGTCCT GGCTGAAAAC
ACCACAATGT CGATTCAGGT TGGTGCAAAT GATGGCGAGA CCATTGATAT CAACCTGCAA
AAAATCGACT CTAAGAGCCT GGGCTTAGGT AGCTACTCTG TTAGCGGTGT ATCTGGTGCA
TTAACCTCAT TAACTGATAC ATCAGTAACA GGTGTAACTA CCACGACCGC GCTTGATTTT
AGCGACATTA GCACTTTTGC TAAAGGTGCC ACAGTACATG GTATTGGGGA CGTCGGCACA
GATGGCGCTT ATGCAGACGG TTATGTTATC CGTACTACTG ACGGTAAACA ATATAAAGGT
GAAGTAGATG CTACTAATGG AAAAGTAACA TTTGCAGATG ATGCTAATGG CGATCCAATT
GATGATGCTA CCAAGCTGGA AGCTGCGGCT CAGTTTAGTC CTGCTGGCAA AGCAACGGCT
TCACCATTGG AGACTTTGGA TGATGCTATC AAACAAGTTG ATGGCCTTCG TAGTTCACTG
GGTGCGGTAC AAAACCGTTT CGAATCTGCG GTCACCAACC TGAATAACAC CGTGACTAAC
CTGACTTCTG CCCGTAGCCG TATCGAAGAT GCTGATTACG CGACTGAAGT GTCTAACATG
AGCCGTGCTC AGATCCTGCA ACAAGCAGGG ACTTCTGTGT TGTCTCAAGC TAACCAAGTT
CCACAGACTG TTCTGTCTCT GCTGAATTAA
 
Protein sequence
MAVINTNSLS LLTQNNLNKS QSSLGTAIER LSSGLRINSA KDDAAGQAIA NRFTSNIKGL 
TQAARNANDG ISIAQTTEGS LNEINNNLQR VRELTVQAQN GSNSSSDLDS IQDEISLRLA
EIDRVSDQTQ FNGKKVLAEN TTMSIQVGAN DGETIDINLQ KIDSKSLGLG SYSVSGVSGA
LTSLTDTSVT GVTTTTALDF SDISTFAKGA TVHGIGDVGT DGAYADGYVI RTTDGKQYKG
EVDATNGKVT FADDANGDPI DDATKLEAAA QFSPAGKATA SPLETLDDAI KQVDGLRSSL
GAVQNRFESA VTNLNNTVTN LTSARSRIED ADYATEVSNM SRAQILQQAG TSVLSQANQV
PQTVLSLLN