Gene SbBS512_E4199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4199 
SymbollpfD 
ID6269605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3925123 
End bp3926196 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content39% 
IMG OID641728019 
Productlong polar fimbrial operon protein LpfD 
Protein accessionYP_001882440 
Protein GI187731851 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3539] P pilus assembly protein, pilin FimA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGT ATATCAAACA GTGGTGCTTT GCTGTGTTTA TGCTCTCGTT AAGTAGTGTA 
GCACTTGCAG CTCCTAAAGG TATCTGTACC CCTGATAATG GCGTATTTCA CAGTACGCTT
GATTTCTCGG GATATCTTAT TACCGCGAAC GAAAATAAGG TTGGAACGAC ATTTAACACC
ACTGTAACTA ATGGATCCTC ATATCCTGGA CGCTGTCATT GTGATACTGG TAACGTAGGA
GAATTCCCCT ATATCTATTA TACGTCAAAA ATAAACCAGG CATTAACTTA TGCTGGGGTT
CACTCTAATA TTAATTATTA TGATCTTAAT CCGAACCTTG ATGTTGGGAT TGCTATAGAT
ATTCTTGGGG TTGGGTATGT GAATGCACCT TTTGAATATC ATGCTAACAA CCCGTCTGGT
AATACAAAGT ACAATTGCAA CCGCATTGAA CCTTTAAGTA TATCTAGTGG GGCTAAAGCG
ATAGTATATT TTTATATCAA GAAGACATTT GCAGGAAAAT TGATTATTCC TGAAACAAAG
ATAGTGACAT TGTATGGAAC AATTAGCCGT GACACCCCGG TGGATTACTC ACAGCCGATG
GCTGATGTTT ATATTCGGGG CGATATTACA GCTCCGCAAA GTTGCGAAAT AAATAATTTA
CAGCCAGTTT ATTTTGATTT TAAAGAAATA CCAGCTGCAG ATTTTTCATC TGTTGTTGGA
AGTGCGGTAA CAACACATAA AATTACTAAA ACGGTCACTA TTGAGTGTGA AAACCTGGGG
ATACTGAATA CCGATGATAT CAGTACGTCT TTTTATGCTA CCGAACCCAA TACGGACAAT
TCAATGGTCG TAACTTCAAA CTCTAACGTG GGGATAAAAA TTTACGACAA AAATAATAAG
GAAATCAAAG TGAACGGTGG TGAGTTGCCA ACAGACATGG GTAAATCAAC AGTGTATGGT
GAAAAATCAG GTAGTGTAAC TTTTTCAGCT GCTCCTGCGA GCCTTACTGG AGCTCGCCCA
GCCCCGGGGC AATTTACTGC AACAGCGACG ATAACTGTTG AGATTGTGCG CTAA
 
Protein sequence
MNKYIKQWCF AVFMLSLSSV ALAAPKGICT PDNGVFHSTL DFSGYLITAN ENKVGTTFNT 
TVTNGSSYPG RCHCDTGNVG EFPYIYYTSK INQALTYAGV HSNINYYDLN PNLDVGIAID
ILGVGYVNAP FEYHANNPSG NTKYNCNRIE PLSISSGAKA IVYFYIKKTF AGKLIIPETK
IVTLYGTISR DTPVDYSQPM ADVYIRGDIT APQSCEINNL QPVYFDFKEI PAADFSSVVG
SAVTTHKITK TVTIECENLG ILNTDDISTS FYATEPNTDN SMVVTSNSNV GIKIYDKNNK
EIKVNGGELP TDMGKSTVYG EKSGSVTFSA APASLTGARP APGQFTATAT ITVEIVR