Gene YpAngola_A3494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3494 
Symbol 
ID5801970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3709397 
End bp3710488 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content46% 
IMG OID641341311 
Producthypothetical protein 
Protein accessionYP_001607824 
Protein GI162420414 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000556203 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.010619 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTTTTAG AACGCCGCCA CATTCAATTT AAAACCGCCA CATTCAAATT TAAAAATGGC 
CACCCGATGA AAAGAAAAAG CACCAAAGCA CTGATCCTTT TTGTTGTGAT CTGTTTGGGA
TTACTGCTGT TGGGTTATCA AAAAGTACAA GATTTTGCGC GTCAGCCCTT GGCGATTAAG
CAAGAGACTT ACTTTACCTT ACCGGCAGGT ACCGGGCGGG TTGCTCTAGA GAATTTATTG
CTACGTGATC ATGTGATTGC TAATACAGGT TTATTTCCCT GGCTACTGCG CATAGAACCC
GAACTGGCTA ATTTTAAGGC TGGGACATAT CGTTTTACGC CGGGTATGAC GGTACGTGAG
ATGCTGGAGT TGTTGGTCAG TGGTAAAGAA GCCCAGTTTA CCGTTCGTTT TATTGAAGGT
AAGCGCCTGC GTGATTGGCT GGATGAATTA CAACAGTCAA AATATATCAA ACATGTGCTG
GAAGGGAAAA CAGATGCTGA AATTGCGCAG TTACTGGGGT TAAAAGAGAG TGAACACCCC
GAAGGCTGGC TCTATCCTGA TACCTACTCC TATACCGCAG GTACAACGGA TTTAACACTG
CTCAAACGTG CCCATCAACG AATGGAGGAA ACCGTTGCAG AAATCTGGCA GGGGCGTGAT
GACGGATTGC CGTATAAGAC CCCAAGTGAT TTGGTTACTA TGGCATCAAT CATTGAAAAA
GAAACCGCGG TAAATGAGGA GCGGGATAAG GTGGCTTCTG TCTTCATTAA CCGTTTACGT
CTTGGTATGC GCCTACAAAC GGATCCAACA GTGATTTATG GTATGGGCGA AAAGTATAAC
GGCAACATTA CTCGTAAAGA CTTAGATACC CCGACACCTT ATAATACCTA TGTTATTTCC
GGTTTGCCGC CAACCCCCAT TGCCATGCCA GGGTTGGCAT CGCTGACCGC TGCTGCCCAT
CCGGCCCAAA CGCCCTATCT CTATTTTGTC GCAGACGGCA AGGGCGGGCA TACATTCACC
ACTAATTTAG CCAGTCATAA TCAAGCCGTG CGTGTTTATC GCCAGTCGCT AAAGGATAAA
AATGAACAGT AA
 
Protein sequence
MFLERRHIQF KTATFKFKNG HPMKRKSTKA LILFVVICLG LLLLGYQKVQ DFARQPLAIK 
QETYFTLPAG TGRVALENLL LRDHVIANTG LFPWLLRIEP ELANFKAGTY RFTPGMTVRE
MLELLVSGKE AQFTVRFIEG KRLRDWLDEL QQSKYIKHVL EGKTDAEIAQ LLGLKESEHP
EGWLYPDTYS YTAGTTDLTL LKRAHQRMEE TVAEIWQGRD DGLPYKTPSD LVTMASIIEK
ETAVNEERDK VASVFINRLR LGMRLQTDPT VIYGMGEKYN GNITRKDLDT PTPYNTYVIS
GLPPTPIAMP GLASLTAAAH PAQTPYLYFV ADGKGGHTFT TNLASHNQAV RVYRQSLKDK
NEQ