Gene YpAngola_A4083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4083 
Symbol 
ID5802562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4349193 
End bp4350320 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content47% 
IMG OID641341862 
Productputative carbohydrate diacid regulator 
Protein accessionYP_001608368 
Protein GI162418234 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3835] Sugar diacid utilization regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.329455 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACCA GCTATCTTAA GGAAGACACC GCTCGCCAAA TCGTTCAGCG CACCATGAGC 
ATTATTGATT ACTCAGTTAA TGTCATGAAT GAATACGGCG TCATTATTGC GTCTGGTGAT
CCTCGTCGTT TACATCAGCG TCATGAAGGT GCCGTTTTAG CGCTGACGGA AAACCGGATG
GTCATGATCG ACGAAGCCGT GGCTGGCAGG CTTAAAGGCG TTAAACCAGG AATTAACCTG
CCGATTATTT TTCGTCAGCG AATGGTAGGC GTTATTGGGA TTTCGGGTGA ACCCGATAAA
GTGCAGGCCT ATGCTGAACT GGTGCGCATG GCGGCGGAGC TGATTTTAGA ACAAGCTGAA
ATGTTAGAGC AAAATCAGTG GGAAAAACGC TATCGCGAAC AGTTAACCTC TCAGCTCATT
GCTCAGCAAG GCAGTGCGGC TTCAGTGGCA TCGATGGCAG CCTATTTAGG ATTAGACCTC
ACACTGCCAC GCATTGCGCT GATCGTCAGT CTGCGTGAAC AAGAGCACAC TCAGCAGCGG
GATCTGGTCG AAATATTAAG CAACCACAGC CGTGAAACAT TGGTCACCTT ACATGGGTTT
GAGCGGGTTG TGGTGCTATT GCCGCTGAAC CTTGCACACA GTGACGATCG TGAAGTGACG
GCCAGAAAGG CTTTGCGCAA ATTGTATGCG CTGCTACAGC CGCGTTTTAG TCTGAATATT
TATGTTGGTG GCATTTTTGA TGGCCTGGAT GGGCTGCAAC GCTCCTATCT TAGCGCACAA
GCGATGCAAG AAACGGCATT GCGACAAAAG CTCACGCAAC CGGTCATGTT CTATCAAGAC
CATTTTTTGA CCGTTCTTCT GAATGATTTT GCCGGGAGTT GGCAGGCTGG TGAACTGAGC
TCTGCCTGGC AAACACTGCG TCAGGCAGAT ACCAAAGACG TTCTGTGTCA CACTCTGCGT
TGTTACTTTG TACAGAATTG TGATCTGTCT CAAACGTCAA AACAACTGCA TATCCATGTC
AATACCCTAC GCTATCGGTT ACAGAAGATT GAAGCTATAA CAGCTTTAAA AATCAATGAA
TTAAGTTCAA TTATTCAGCT CTATATTGGC ATGAAAATAG AGAGATAA
 
Protein sequence
MPTSYLKEDT ARQIVQRTMS IIDYSVNVMN EYGVIIASGD PRRLHQRHEG AVLALTENRM 
VMIDEAVAGR LKGVKPGINL PIIFRQRMVG VIGISGEPDK VQAYAELVRM AAELILEQAE
MLEQNQWEKR YREQLTSQLI AQQGSAASVA SMAAYLGLDL TLPRIALIVS LREQEHTQQR
DLVEILSNHS RETLVTLHGF ERVVVLLPLN LAHSDDREVT ARKALRKLYA LLQPRFSLNI
YVGGIFDGLD GLQRSYLSAQ AMQETALRQK LTQPVMFYQD HFLTVLLNDF AGSWQAGELS
SAWQTLRQAD TKDVLCHTLR CYFVQNCDLS QTSKQLHIHV NTLRYRLQKI EAITALKINE
LSSIIQLYIG MKIER