Gene YpAngola_A4049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4049 
Symbol 
ID5802528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4307865 
End bp4308941 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content51% 
IMG OID641341833 
Producthypothetical protein 
Protein accessionYP_001608340 
Protein GI162420587 
COG category[R] General function prediction only 
COG ID[COG0795] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000497834 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGGTG TATTAGACCG CTATATCGGA CGGACTATCC TCAATACTAT CCTGATGACG 
TTATTAATGT TGGTGTCGCT GTCGGGCATC ATCAAGTTTG TCGATCAACT GCGTAAAGTC
GGGCAGGGGG ACTACTCGGC GGCCTCTGCG GGTATGTACA CTATCCTGAG CATCCCAAAG
GACATCGCGG TTTTCTTCCC GATGGCGGCC CTCTTAGGGG CATTACTGGG GCTTGGGACT
TTAGCCAGTC GCAGTGAGTT GGTAGTTATG CAAGCGTCAG GTTTTACCCG GATGCAAATC
GCAGCGTCAG TGATGAAAAC GGCAATCCCT CTGGTGTTGC TGACGATGGC TATCGGTGAG
TGGGTGGCAC CGCAAGGTGA GCAGACCGCG CGTAATTTCC GGACACAGCA GATGTACGGT
GGTTCGTTAC TCTCAACTCA GTCGGGTTTA TGGGCGAAAG ATGGCTCTGA CTTTATTTAT
ATTCAGCGGG TGTCTGGCGA AAGCGAGTTG ACGGGTGTCA ATATTTATCA TTTTGATAAA
GAAGATCGTC TGCTTTCGGT GCGGTATGCG GCGACGGCGA CCTATGAAAA AGACAATAAA
ACCTGGCGGT TATCGCAGGT CGATGAATCT GATTTAAGTA ATCCTACTCA GGTGACAGGT
TCACAGACGC TGACCGGCGA GTGGAAGACC AATCTGACGC CTGAGAAGTT GGGTGTGGTG
GCGATGGATC CAGATTCGCT CTCCATTAGC GGGTTGCACG ACTACAGTAA ATATCTACAG
CAAAGTGGCC AAGAGTCTAA CCGCTACGAA CTGAGTATGT GGAGCAAGGT ATTTGCTCCC
TTCTCTGTTG CGGTCATGAT GCTGATGGCG CTGTCGTTTA TTTTTGGCCC ATTGCGCAGC
GTGCCAATGG GTGTCCGGGT GGTCACCGGT ATTTTCTTCG GCTTTGTTTT CTACGTGCTG
GATCAAGTTT TTGGTCGACT TAGCTTGGTT TATGGCATCC CACCAATGCT GGGTGCGCTG
TTGCCGAGTA TGTTATTCCT CCTGATCAGC ATTTGTTTGC TGCTAAAACG GCGGTAA
 
Protein sequence
MFGVLDRYIG RTILNTILMT LLMLVSLSGI IKFVDQLRKV GQGDYSAASA GMYTILSIPK 
DIAVFFPMAA LLGALLGLGT LASRSELVVM QASGFTRMQI AASVMKTAIP LVLLTMAIGE
WVAPQGEQTA RNFRTQQMYG GSLLSTQSGL WAKDGSDFIY IQRVSGESEL TGVNIYHFDK
EDRLLSVRYA ATATYEKDNK TWRLSQVDES DLSNPTQVTG SQTLTGEWKT NLTPEKLGVV
AMDPDSLSIS GLHDYSKYLQ QSGQESNRYE LSMWSKVFAP FSVAVMMLMA LSFIFGPLRS
VPMGVRVVTG IFFGFVFYVL DQVFGRLSLV YGIPPMLGAL LPSMLFLLIS ICLLLKRR