Gene YpAngola_A3642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3642 
Symbol 
ID5802119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3863055 
End bp3864560 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content47% 
IMG OID641341453 
ProductRmuC domain-containing protein 
Protein accessionYP_001607965 
Protein GI162419667 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATATCA GTTTATTTTA TGGGCTGGGT GGCTGTTTGG TCGGGGGGCT GATCGGGTGG 
CTAATTGCCA GTTTGTCCCA GCAGCGAACA AAAGCACAGC AAGATATTGA ACGGCGATTA
CTGGAGCAGG CATTACAGCA GGCACAACAG AGTATTGCTA CATTGCAGAT GACTCAACAA
CGTAATGAGC AACAGTTACG GCAAAGTGAG TTGGAACAAC GTAATTTGCA TAGCCAATTA
GCGGCAAATA GCGAAAAATT ACAGCAGCTT GCTCATTGGC GTAATGAATG TGAACAGCTT
AATCAAGAAT TGCGGGCTCA AAGGGAAATC AATAGTGCCC AAGAAGCAGA ATTACGTGAA
GTCACCATCC GCTTAGAAGA AACTCGCTTA GCCACAGAAG AAAAGCAGCG TTTATTACTT
AATAGTGAAC AGCGACTGAC CACTCAGTTT GAGAATTTGG CAAACCGCAT CTTTGAGCAA
ACGGGGCGTC GGGCTGATGA ACAGAATAAG CAAAGCTTAG ATCGCCTGCT ATTACCTTTA
CGTGAACAGC TTGATGGCTT TCGTCGTCAG GTTCAGGATA GCTTTGGGCA AGAAGCTCGG
GAGCGTCATA CTCTGACTCA CGAAATTCGG AACCTGCAAC AATTGAATGC CCAGATGGCG
CGAGAGGCAT TGAATCTGAC CAAGGCTTTG AAAGGGGATA ATAAAACACA GGGTAACTGG
GGGGAAGTCG TTCTGGCCAA AGTGCTTGAA GCCTCAGGGT TACGTGAGGG GCATGAATAT
CAAACACAGG TTAGCGTTAA AATTGACCAA ACCAGTCGTA TGCAGCCGGA TGTGATTGTT
CGATTACCCC AAGGCAAAGA TGTCGTTATT GACGCTAAAA TGTCATTGGT TGCCTATGAA
CGTTATTTTA ATAGCGAAGA TGATGCTGAA CGTGAGGTTG CGTTGAATGA ACATTTATCA
TCATTACGTG GCCATATCAG GATGCTGGGG CGTAAAGATT ACCAGCAACT CCCCGGTTTA
CGCTCCCTCG ATTATGTGTT GATGTTTATC CCGGTAGAAC CTGCATTTTT GGTGGCTATT
GACCGCCAGC CTGAGTTGAT CAACGAAGCA CTACAACACA ATATCATGTT GGTCAGCCCT
ACCACATTAC TAGTGGCTTT GCGCACTATC ACCAATTTGT GGCGCTACGA GCATCAAAGC
CAAAATGCGC AACGTATTGC TGAAAGAGCC GCCAGACTTT ATGACAAAGT ACGCTTGTTT
GTTGATGATA TGGCGTCTTT GGGGCAAAGC CTCGATAAAG CACAGCTAAG CTATCACCAG
GCAATGAATA AACTGTCCCA GGGCCGTGGT AACCTAGTTG GGCAAGTAGA GAGTTTTCGC
ACTCTAGGGG TCGAGGTAAA GCGGCCTATT AGCCCATTAC TGGCAGAAAA AGCCTGCGCG
GAGCATCAAC CTGAGGGTGA TTTAGCGCTA TCCGATGACG CAGAATCAGG GGCATTTCCA
GAGTAA
 
Protein sequence
MDISLFYGLG GCLVGGLIGW LIASLSQQRT KAQQDIERRL LEQALQQAQQ SIATLQMTQQ 
RNEQQLRQSE LEQRNLHSQL AANSEKLQQL AHWRNECEQL NQELRAQREI NSAQEAELRE
VTIRLEETRL ATEEKQRLLL NSEQRLTTQF ENLANRIFEQ TGRRADEQNK QSLDRLLLPL
REQLDGFRRQ VQDSFGQEAR ERHTLTHEIR NLQQLNAQMA REALNLTKAL KGDNKTQGNW
GEVVLAKVLE ASGLREGHEY QTQVSVKIDQ TSRMQPDVIV RLPQGKDVVI DAKMSLVAYE
RYFNSEDDAE REVALNEHLS SLRGHIRMLG RKDYQQLPGL RSLDYVLMFI PVEPAFLVAI
DRQPELINEA LQHNIMLVSP TTLLVALRTI TNLWRYEHQS QNAQRIAERA ARLYDKVRLF
VDDMASLGQS LDKAQLSYHQ AMNKLSQGRG NLVGQVESFR TLGVEVKRPI SPLLAEKACA
EHQPEGDLAL SDDAESGAFP E