Gene YpAngola_A1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1072 
Symbol 
ID5799535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1099760 
End bp1101139 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content48% 
IMG OID641339057 
Producthypothetical protein 
Protein accessionYP_001605629 
Protein GI162419266 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAATA AAAGAAAAAT TGTCCTCGCC GCTATATTAC TCACGGTTAA CGGCGGCTTG 
TTCGCCCAGA GCGTCGTGAT TGATCAATTA AAAGTTTCTG AGCATCTTTA TCCAAAGGGA
TTTGAATCCG AATTTCAGAA TAACCTCGAT TTCTATCGTG ACGGTGTTGG TGTCGATGAG
AAGGCCAAAG TGCCTTACGA TACGATCCGT ATTGAAGGTA CTGAAATTGT TAAAGGTTAT
TATACCAACA CCACCGAGGT GGGGTTGTAT CTCAATATAT TAACCGAGTC GGTAAAAGCC
GGTAACCTGC AGGCGTTACA ACGAATTAAA GAGACATTAA CCACCTTGGA GCAGGCCCCC
AAATGGAACG GGTTATTTTA CTGGCCCTAT GATATCCGCG ACGGCAAGCT GGTGACCAAT
CCCGATGAAA TCGTACCTGC GGTGGATAAC GGTAATCTCT CGTTCGCACT GGCGGGGGTT
GCGGGTGCAT TTCTAGATTC GAGCGATGCG GACAAGCAAG AGATCGTGCA ACGCATTGAG
GCGATACTGG ACGGACAGAA ACCAGGCTGG GCCGCCCTGT ACGATGAAAA TAAAGGTCTG
CTCTCCTCTG GTTGGTCGAC AAAAAACAAT GCGTCACTGG GCTACTTCGT TGATCGCAAG
GGCAATGAAA GCCGTGCGGC GGTGGCCTGG GCGGTGCTGG CGACTAAAGA TATGGGAGCC
AAAGCATTAC CGGTTAGCGC GTTCAGTAAA ATGGAGCTCT ACACCCAACG CTATGAAATA
AACGGCAAGC AATACAACCC GCTGCTGACC TGGGATGGCG CTTATTTCCA GATGATGATG
CCGCAAATAT GGCTGAATGA GCGTGAACTG ATGCCTAACT ACGGCATTGT CGAGGATCAC
ACCTTTATTC AAAAAGTCTA TGCCAGCAAG CATGGCATTC CAATGGTTTC TTCCTCCGCT
ACCACGGATA ACGCTTACCA CGCCTTCGGT GTGCCACAGC TTTCCGAGAG CAAAGTTCGC
TTCAAGAATA AGATCGATGA TGGCTATACC GGTACGCCGC ACGCAATAGC GCTCTCCTAT
ATCGTCGATC CTGCCGGAGC GATCAGCGCA TTAAAGAAAC TGAAACAGGC TTATCCGAAT
ATCGAATCCC CGTATGGCTG GTATGACGCT GTCGATAGCA GCGGCAAGAT CTCAAAAAAT
ATCCTTTCCC TTGATGTCGG CATGTTTGTT GGTGCTTTTC TGGCGAAAGA GATCAATGCC
GATGTTGAAA AATACCTACA AAGCAAGGGC GATATGGAAT TGCTAAAAGA GATGTATCAG
TCCTACGTTC CCAATAATTA CAAACCATTG GATGGTCTCT CCAGCTCTTC TCTGCACTGA
 
Protein sequence
MINKRKIVLA AILLTVNGGL FAQSVVIDQL KVSEHLYPKG FESEFQNNLD FYRDGVGVDE 
KAKVPYDTIR IEGTEIVKGY YTNTTEVGLY LNILTESVKA GNLQALQRIK ETLTTLEQAP
KWNGLFYWPY DIRDGKLVTN PDEIVPAVDN GNLSFALAGV AGAFLDSSDA DKQEIVQRIE
AILDGQKPGW AALYDENKGL LSSGWSTKNN ASLGYFVDRK GNESRAAVAW AVLATKDMGA
KALPVSAFSK MELYTQRYEI NGKQYNPLLT WDGAYFQMMM PQIWLNEREL MPNYGIVEDH
TFIQKVYASK HGIPMVSSSA TTDNAYHAFG VPQLSESKVR FKNKIDDGYT GTPHAIALSY
IVDPAGAISA LKKLKQAYPN IESPYGWYDA VDSSGKISKN ILSLDVGMFV GAFLAKEINA
DVEKYLQSKG DMELLKEMYQ SYVPNNYKPL DGLSSSSLH