Gene YpAngola_A4128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4128 
Symbol 
ID5802608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4415170 
End bp4416561 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content38% 
IMG OID641341904 
Producthypothetical protein 
Protein accessionYP_001608409 
Protein GI162418997 
COG category 
COG ID 
TIGRFAM ID[TIGR01731] adhesin HecA family 20-residue repeat (two copies)
[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0784897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.299338 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTGA CATATATTAA AACGAAACAG GGGTTGTCTA TATTACCTCT CTCTATCATA 
TTGTCGCTCT ATGGGAGCTC GGTTGCTTAT GCTGATAATA TTATTCCTGA TATACAGGCC
CCAGCGGGAC AGCAAGCGGA AGTCTCTATT ATAGAAACAC CGCCCAGTAC ATGCCGAGCG
CTTACCTCTT ACTGTATCGG TATGACAGAA ACTGTCGTTA ATATTCAAGC GCCTGATGAA
AATGGGTTAT CACATAACAA GTATTCTAAA TTTGATGTGG TCGCTAATGG CTTGTTCGAT
GTCACGACAC TCAATAATCG CTTAGCTCAA GAGGTTAATG GCAACTCTTT TTTACAAGAT
AAATCGGCAA CCATTATATT AAATGAAGTC AACTCATCAC ATGCCAGTCT ATTAGATGGG
AATCTCCGTG TTGACGGGGG AAATGCGCAT ATTATTATTG CCAATCCAGC AGGTATTAAT
TGTCGAGGGT GCTCTTTTAC TAATGCCTCT CATGTGACAT TGACGACAGG TACGCCATCT
TTTTCTGATA ATAAACTAAA CAGTTTTATT GTTGAGCAGG GAAATATCAA TATTGAAAAA
AATCCCTCTT ACTATATGAG AACAGGTTTA CGGTATAAGG GGGCAGACAA GACTTATCTT
GATCTATTCG CAGACACTAT TACGGTTAGT GGTGAGATCA ATGCGGACGA TGGCTATATT
GTCACGGGAA AAAATAAAGT GGGTTTCTCT TTGCCTGGGC AACCATTGCA AGTGTCGCGT
TTAGGCAATG AAAATACACC AGTACCAGGT ACGGTTAGTT TGGATGTCAG CGAAATTGGG
GGAATGTACA CCAATAAAAT TCGTATCCAT GCAACCGATG GCACGATTAA AAATAAGGGG
GCAATACGCG CGAACGATAC ACTAAGCCTC AGTTCTGCAG CCAATATAGA TAACAGTAAT
GGGAATATAT CAGGGAAAAT GGTGTTACTG AGTAGCGAGG GTGTTATAAA TAACTCTGGT
GGCACAATAT TAAATAATGG TGAGTATGAT TTATTACCTT CTCAAGGTAT TAAAATAACA
TCTCGTGGCT TAAATAATGA AGGTGGAAAA ATAGAGTATA AAAATGGTAG CGTTGAAATA
GCAACAGTTA ACACCATCAA AAATGGTAAA GGTACAATTA AAGCAACATC AACGCAAGGG
CGGGTAAGAA TAAAACTTCA TAGTAATAAC CTTAATAATA CTGGAGGGAG TATTATTTCT
TCAGGGAAAA TAGAGGGTAA AGTTAATAAC ATACGAAACA ATGGAGGGGC TATTATTGGG
TTAGGTGGAG TGGATTTGAA TGAAACTGTT TTAATTAATG ATACCGGTAA AATAATTTCT
GATTTTAATT AA
 
Protein sequence
MKLTYIKTKQ GLSILPLSII LSLYGSSVAY ADNIIPDIQA PAGQQAEVSI IETPPSTCRA 
LTSYCIGMTE TVVNIQAPDE NGLSHNKYSK FDVVANGLFD VTTLNNRLAQ EVNGNSFLQD
KSATIILNEV NSSHASLLDG NLRVDGGNAH IIIANPAGIN CRGCSFTNAS HVTLTTGTPS
FSDNKLNSFI VEQGNINIEK NPSYYMRTGL RYKGADKTYL DLFADTITVS GEINADDGYI
VTGKNKVGFS LPGQPLQVSR LGNENTPVPG TVSLDVSEIG GMYTNKIRIH ATDGTIKNKG
AIRANDTLSL SSAANIDNSN GNISGKMVLL SSEGVINNSG GTILNNGEYD LLPSQGIKIT
SRGLNNEGGK IEYKNGSVEI ATVNTIKNGK GTIKATSTQG RVRIKLHSNN LNNTGGSIIS
SGKIEGKVNN IRNNGGAIIG LGGVDLNETV LINDTGKIIS DFN