Gene YpAngola_A3249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A3249 
SymbolgalR 
ID5801725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp3448490 
End bp3449512 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content52% 
IMG OID641341077 
ProductDNA-binding transcriptional regulator GalR 
Protein accessionYP_001607599 
Protein GI162418841 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0000735815 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCACTA TAAAGGATGT TGCCAAGCTG GCGGGTGTTT CCGTCGCAAC GGTATCTCGT 
GTTATCAATC ATTCTCCCAA AGCCAGTGAA GCATCACGCG TGGCGGTGTG CAAGGCGATG
GAACAACTGC AATACCACCC GAATGCCAAC GCCCGAGCAC TGGCGCAACA ATCGACAGAA
ACGGTCGGTA TGATTGTGTC TGATGTCTCG GACCCTTTCT TCGGTGCGAT GGTGAAAGCC
GTCGAACAAG TCGCGTATGC CACCGGTAAT TTCCTGTTAA TTGGCAACGG TTACCATGAT
GCCGAAAAAG AACGTCAGGC CATCGAACAA CTCATTCGCC ACCGCTGCGC TGCGCTGGTG
GTACATGCCA AAAAATTACC CGATGACGAA CTGACGTCAT TAATGGAACA AATTCCTGGC
ATGGTGTTAA TTAACCGCAC CTTACCGGGC TTCGAACCCC GTTGTATCGC ATTAGATGAC
CGCTATGGTG CCTGGCTGGC AACTCGCCAT CTCATCCAGC AGGGGCATAA ACGGGTCGCG
TTCATTTGCT CCAATCATCA GATTTCCGAT GCGCTTGACC GGATGCAAGG CTATTTGGAT
GCGTTGAAAG AATTTGATAT CCCGGTTGAT GAGCGTTTAA TTACCTACGG CACCCCCGAC
GAACTCGGCG GTGAGCAGGC AATGACCGAT CTACTTGGCC GTGGTAAACA CTTCACCGCG
GTAAGCTGTT ATAACGACTC AATGGCGGCC GGGGCGTTAT CGGTTCTCAG TGATAACAGT
ATTGATGTGC CACAGGAGAT TTCACTCATC GGTTTTGATG ATGTATTAAT CTCCCGTTAC
CTGCGCCCAC GCCTGACGAC AATCCGTTAC CCAGTCGTTG CCATGTCTAC CCAGGCCGCT
GAGTTGGCAC TCGCATTAGC CAACAACACA CCGTTACCCG AAATTACCAA TATGTTCAGC
CCAACACTGG TTCGCCGCCA CTCTGTGGCC AGCCCGCCAA GCCTACGGGA TGACACTGAT
TAA
 
Protein sequence
MATIKDVAKL AGVSVATVSR VINHSPKASE ASRVAVCKAM EQLQYHPNAN ARALAQQSTE 
TVGMIVSDVS DPFFGAMVKA VEQVAYATGN FLLIGNGYHD AEKERQAIEQ LIRHRCAALV
VHAKKLPDDE LTSLMEQIPG MVLINRTLPG FEPRCIALDD RYGAWLATRH LIQQGHKRVA
FICSNHQISD ALDRMQGYLD ALKEFDIPVD ERLITYGTPD ELGGEQAMTD LLGRGKHFTA
VSCYNDSMAA GALSVLSDNS IDVPQEISLI GFDDVLISRY LRPRLTTIRY PVVAMSTQAA
ELALALANNT PLPEITNMFS PTLVRRHSVA SPPSLRDDTD