Gene YpAngola_A2134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2134 
Symbol 
ID5800604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2231666 
End bp2233075 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content49% 
IMG OID641340042 
ProductYHS domain-containing protein 
Protein accessionYP_001606587 
Protein GI162418256 
COG category[S] Function unknown 
COG ID[COG4393] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.129693 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTATT TTTTCATATC CGTACTACAA GCCTTTTTAC CTGTGGCTCT GTTATTGGGG 
CTGAACTGGG TTGTTCGGCC AGCACCAGTA CTGAATCGGA TAGTGTGGAT AACCATACTG
ATGGCCATCG TTGGCATCTG GATGGGTAAT TATTATCCAA AATCGCAACA GTGGCAGTTG
GCATTGGCGG GGATTCAACT CCTTTCACTA CTGTTATTCT TATGTAGCCA ATTTATATGT
CGGGTGTCAT TAGGCTATTT CTGGCAAGCA TTATTGGTCT TTGGTGCGGC ATTGAATTGG
GGCAACAACC CCAATCTAGG GGCGCTCACT AATACCCATG TTATTAATAC TGATTTATTG
CTCAATCTGG CTGCCACCGT GGTGGCATTT GGCTGGGTTA TATTCTGCGC CGTATTATTA
TTGATGATGG TACGACAGTT GCCACGTTGT CGCGGACCCT TGCTGGTCGC ACTCACCCTG
CTATTGATCT TGCCTATTAG CGGGGATGTC TTCCTGCTGC TTATGAAGTT ACACGTGGTG
CCGCTAACCA AATCACTCCT TAGCTACGTG GCTCTGGTGA CCAACGGGCA TGCATGGCTT
AACTATATTT GTGCCTTATT ATTAGCATTC ACGGTGCTGT GCTATTTGTG GCCGTGGAGC
CGGTCCCGCC ATGTGGTCAG CCAAACGTCA GAAGCCATTG CCAAACGTAA AGCGCTGGCA
GCGTACCGCA ATGTCCGGCG AATTTTGTTT TTATCGTTGC TGGCATTGGT GGTGGTCGCT
GCGGCTCAGT TTTATTGGGA TAAAGTGGCC TCGCAACCTC CTCAGTTATC AGAAGCATTG
CCCGTGACAC TGTCGTCTGA TGGGTTGGTC CATATCCCGG TTGAGCAAGT CCGAGATGGC
AAACTACACC GCTTTGTCTG GATCGCTGAT GACGGCAAAG CCGTTCGCTT CTTTATTATC
AACCGCTATC CAGACCGACT GCGCCTGAGT GTGGTCTTCG ACGCTTGCCT ATTGTGCGGC
GACCAAGGTT ATGTCATGGA AGGTAATCAG GTCATTTGTG TTGCTTGCGC GGTACATATC
TTCATTCCCT CCATCGGTAA AGCCGGTGGT TGTAACCCGA TACCTCTGGA AAATTGGCAG
AGTGATGATA ACGAGTTGAT TATTCCCAGA GCGTCTTTGG CGGCGGGCGT CAATTACTTT
ACGACGGTGG TCACACTGGA TGTCGTTGAT CCGGTTGATA AGAGCCATCT GACCAATCAA
AAATCCGAGT ATAAATACAG CTATGGCGGG AAAACCTATT TCTTCTCCTC CGAGGCGAAT
TACAACCGTT TCCGCGATCA CCCAGAACAG TTTGTCACGC CGGTAGCCGG CGAAGGTGAT
GCCAGCGATG ATAGACAGGA GAACCCATAA
 
Protein sequence
MSYFFISVLQ AFLPVALLLG LNWVVRPAPV LNRIVWITIL MAIVGIWMGN YYPKSQQWQL 
ALAGIQLLSL LLFLCSQFIC RVSLGYFWQA LLVFGAALNW GNNPNLGALT NTHVINTDLL
LNLAATVVAF GWVIFCAVLL LMMVRQLPRC RGPLLVALTL LLILPISGDV FLLLMKLHVV
PLTKSLLSYV ALVTNGHAWL NYICALLLAF TVLCYLWPWS RSRHVVSQTS EAIAKRKALA
AYRNVRRILF LSLLALVVVA AAQFYWDKVA SQPPQLSEAL PVTLSSDGLV HIPVEQVRDG
KLHRFVWIAD DGKAVRFFII NRYPDRLRLS VVFDACLLCG DQGYVMEGNQ VICVACAVHI
FIPSIGKAGG CNPIPLENWQ SDDNELIIPR ASLAAGVNYF TTVVTLDVVD PVDKSHLTNQ
KSEYKYSYGG KTYFFSSEAN YNRFRDHPEQ FVTPVAGEGD ASDDRQENP