Gene YpAngola_A0250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A0250 
Symbol 
ID5798714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp260986 
End bp262092 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content49% 
IMG OID641338264 
Productfimbrial family protein 
Protein accessionYP_001604870 
Protein GI162419400 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3539] P pilus assembly protein, pilin FimA 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.319484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.000123535 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAA ATATATTCCT CACGTTACGC AGTGAGAGGT ATAAGGCGGG ACTGCTGTTG 
TTGCTGCTAC TCCTGTTATG TACCACCAGT AAACCGAGTT TTGCCGCACT GCAAGTTAAC
TCTTGTTCTC TTGCTGTATC GGGGGCCGGA GGGGAAACAC CGTTTGTCAT AAAAGCACTG
CCAGTCACGT TGACGAATGG TTTTGTCTTG GGGAGCATGA ACTTCAGTGT CGTCACGACA
TTCACAAAAA CAGGGACGAC AGCAGATTAT TTATACCTCG GTGGTTGGTC GATGCTTGCT
GGGGGGGCGG TTATGCCTGT TTCTACCGGT GTTCCAGGGG TGAAGATGTC GGTGAATGTC
AGTGACGCTT TATTCGGGGG ATATGAATGG GGGAATACAG AGAAGGCAAC TCAGTCCTTG
TTTGTACCAG GGACAAAAAT AACGATTGCT AATCGAGTCA CGGTCCAACT CATTGTGACT
GATGCCAGTG TTTATCAAGG GGGAGTGATT AATTGGTTTA ATGTGGCGAG AAGTTTTGCG
TTAGTTTCAG GTAATGCTCA GGTCTGGTCG ACTAAAAGTG TTGCTACTGG TGGCTCACGT
TGTGGGGATA TGGTGACTTT TAATATCGGG TCAACAAATA TCGTATTGCC ACCTCCCACT
GTGGCGACGT GTGATCTGGG GGCGACCGAT ATTGTGGTTG CACTGGACCC GATTGATACC
TCTTCCCTAC AAACGCAAGG TGACCGGGCG GGCGGACAAG CTTTTTCGAT CCCACTGGGC
AGTTGCGCTA AAGATGCTAA ACCCTATATC ACTTTTACCG ACAGTAGTAA CAAGGCTAAC
CGTACCAATA TACTGAGTTT ATCACCGAGC AGCACGGCGA CCGGTGTGGG CCTTGTGCTT
GAAAAAAGTG ACGGCAATCT GGTGACATTT GGCGCAGAAA ACGCCAGTGT CAGCGCCAGT
AATGTGGGGC AATTTTTGAT TGGTACCTCA ACTGCCGCAG GAGGGAGCAT GCCTTTAAAT
CTAACGGCTC GCTATATTCG CGCTGAAGGG GTGTTAAAGA GCGGTAATGT GAAAGCGGAT
GCAATCTTTA CCGTCGCCTA CCCTTAA
 
Protein sequence
MKKNIFLTLR SERYKAGLLL LLLLLLCTTS KPSFAALQVN SCSLAVSGAG GETPFVIKAL 
PVTLTNGFVL GSMNFSVVTT FTKTGTTADY LYLGGWSMLA GGAVMPVSTG VPGVKMSVNV
SDALFGGYEW GNTEKATQSL FVPGTKITIA NRVTVQLIVT DASVYQGGVI NWFNVARSFA
LVSGNAQVWS TKSVATGGSR CGDMVTFNIG STNIVLPPPT VATCDLGATD IVVALDPIDT
SSLQTQGDRA GGQAFSIPLG SCAKDAKPYI TFTDSSNKAN RTNILSLSPS STATGVGLVL
EKSDGNLVTF GAENASVSAS NVGQFLIGTS TAAGGSMPLN LTARYIRAEG VLKSGNVKAD
AIFTVAYP