Gene YpAngola_A3362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3362 
Symbol 
ID5801839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3577903 
End bp3578985 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content48% 
IMG OID641341183 
ProductLacI family sugar-binding transcriptional regulator 
Protein accessionYP_001607705 
Protein GI162420065 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGGAA AGCTAAAAAT ACAAGAAATA GCTAATCAAA CGGGTCTTTC GATTAGCACG 
GTATCAAGAG TGTTAGCGGG TAAAGGGAAT ACCAGCGCCA AAGCGAAACA GCAGGTCATG
GCGTATGCGA AATCGCAGGG GATCTTGCAG AATTTATCCA GCGGGCGGTT GATGTTGAAT
AATATTATGG TGTTTGCTCC CCATCGCGCT TTTGATGTAC GAACAGATAT TTTTTACTAC
AAAGTCATAC AGGGGATCAC TCAGGCGGTT AGCCAACATG AAGTGATGAT CCGGTACTGC
GGGTTATCCG AAACCCACAG TGATATTTCG CTATTCTTAG ATAAAATGAC CCACCCACAG
AGTGAGGCGG CGATTATTAT TGGCATTGAT GATCCACGGA TCCATGCCTT GGCCGCCAGT
CTCCATAAGC CAGCGGTATT AATTAATTGT CGCGATAAAG AAATGTCACT GGATAGCGTT
TCTCCTGACC ACCAATCAAT TGGCGAGTTT TCTGCCCATT ACCTGATACA GCAAGGGCAT
CGCCGTATTC TAACGCTACA GTGTTTGCGT CGTAATACCA TGGAGTTGCG GTTGCTGGGC
ATTAAAGACG CGTTTGCCAG TAATAATATG CGCTTCGATG ATCATCAGCA CCTGATCACC
ACTCATGGGT TTGGCGCTGA AGAGGCGGAG CAGGCGATAA CCACATTTTT CACAGCCTGT
GAGGATGAAA GCCGGCTGCC GACGGCAATT TTGGTCGGTG GCGACTATAT GGCTGTTGGC
GCGGTTAACG CGCTGAATAA ACTCAACGTG AATGTGCCCA ACAGGGTGTC AGTCATGAGT
ATGGACGGCT TTAACTTGGC GGAAATCCAT GATGTTCCAC TGACCGCGGT GCATGTTCCT
CGTGATGAAC TGGGGGCAGA GGCTATCCAA TTGTTACAGC GGCGGGTATT ACGCCCGGAT
GCACCGTTTA GCAACTTATT ATTGCAGGGT AAGCTGGTGG TTCGCTCCTC GGTAAAGCAG
GTCAATCAGA ATAGGGCGAT GCCTGCAGCA AATAAGCCGA CGGATCAGAT CTATGATTTA
TAA
 
Protein sequence
MNGKLKIQEI ANQTGLSIST VSRVLAGKGN TSAKAKQQVM AYAKSQGILQ NLSSGRLMLN 
NIMVFAPHRA FDVRTDIFYY KVIQGITQAV SQHEVMIRYC GLSETHSDIS LFLDKMTHPQ
SEAAIIIGID DPRIHALAAS LHKPAVLINC RDKEMSLDSV SPDHQSIGEF SAHYLIQQGH
RRILTLQCLR RNTMELRLLG IKDAFASNNM RFDDHQHLIT THGFGAEEAE QAITTFFTAC
EDESRLPTAI LVGGDYMAVG AVNALNKLNV NVPNRVSVMS MDGFNLAEIH DVPLTAVHVP
RDELGAEAIQ LLQRRVLRPD APFSNLLLQG KLVVRSSVKQ VNQNRAMPAA NKPTDQIYDL