Gene YpAngola_A1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1043 
Symbol 
ID5799506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1068355 
End bp1069554 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content46% 
IMG OID641339031 
Producttype IV pilin biogenesis protein 
Protein accessionYP_001605603 
Protein GI162421450 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0343376 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCCGCC ATCGTTTATT CAATTGGACA GCTCTCAACA AAACAGGGGA GCTACAGACG 
GGCATGCTAC TGGCAACTGA GAGAAACAGT GTCTATGAAC ATATCATCCA ACATGGCTTA
CAGCCCCTAG GCGTGAAAGG GGGAAGGCGG TTATCTGCAC GCTACTGGCA AGGGGAACGG
TTGGTGGCAA TGACCCGCCA ATTAGCGACT TTATTGCAAG CGGGTTTACC GTTGGTCAAC
AGTTTACAAT TATTGGCAAA AGAGGCAGAT GACTCAGCAT GGCGTTGCCT ATTAGATGAG
ATAAGTCAGC AAGTTGCACA AGGCCAGTCA CTCTCGGAAG TGATGGAACA GTATCCACAC
GTATTCCCCC GACTGTATCC TCCAGTGGTT GCCGTCGGGG AGCTTACGGG TAATCTTGAG
CAATGTTGTA CTCAATTAGT ACACCATCAG GAACGGCAGC AAAATTTACA CAAAAAAGTC
ATAAAAGCGC TGAAATACCC CGTTGTGGTC TGCATCGTCG CATTGGTAGT CAGTGTCATT
ATGTTAGTCA TGGTGTTACC CGAATTTGCA CAAATATATC AATCGTTTGA TACCCCCCTG
CCGGGGCTAA CTGCAAGCTT ACTGTGGCTA TCAACTTTTC TCACTTTTTA TGGCCCCTAT
CTGGCGCTGA TAATAGCAAT AGTGTGTATT GGGTATTTCT ATACATTACG AAAAAAATCT
CGCTGGCAGC AATGGGAACA GACCATTCTA TTAAGCATTC CCTTAGTCTC AACATTAATC
CGTGGTAGCT GCCTCAGCCA AATTTTTCAA ACGTTAGCTA TCACACAACA AGCCGGGCTA
CCCCTGTCAG CCGGGTTAGA TGCAGCGGCT CGATCCATTC ACAACTACAA TTATCAGCAA
GCCTTAAGGT GTATTCAAAA ACAAATTAGC CAAGGTATAC CGCTGTATAC CACTCTTAAT
CAGCACCCTT TATTTCCTGC CATTTGTCAG CAGCTCATTA GGGTTGGTGA AGAATCAGGC
TCACAGGATG TGTTGCTGGA AAAGTTAGCC TGTTGGCATC AGCAACAAAC TCAGAATTTG
GCAGATAACG TCACTCAAAT GCTAGAGCCA CTTCTTATGC TGATTATTGG CAGTATTGTC
GGCGTACTGG TGATCGCCAT GTATTTACCC ATATTCCAGT TAGGTGATGT TATTGGATAA
 
Protein sequence
MSRHRLFNWT ALNKTGELQT GMLLATERNS VYEHIIQHGL QPLGVKGGRR LSARYWQGER 
LVAMTRQLAT LLQAGLPLVN SLQLLAKEAD DSAWRCLLDE ISQQVAQGQS LSEVMEQYPH
VFPRLYPPVV AVGELTGNLE QCCTQLVHHQ ERQQNLHKKV IKALKYPVVV CIVALVVSVI
MLVMVLPEFA QIYQSFDTPL PGLTASLLWL STFLTFYGPY LALIIAIVCI GYFYTLRKKS
RWQQWEQTIL LSIPLVSTLI RGSCLSQIFQ TLAITQQAGL PLSAGLDAAA RSIHNYNYQQ
ALRCIQKQIS QGIPLYTTLN QHPLFPAICQ QLIRVGEESG SQDVLLEKLA CWHQQQTQNL
ADNVTQMLEP LLMLIIGSIV GVLVIAMYLP IFQLGDVIG