Gene B21_01311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01311 
SymboltyrR 
ID8114821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1372493 
End bp1374034 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content52% 
IMG OID644847559 
Producthypothetical protein 
Protein accessionYP_002999132 
Protein GI251784828 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG3283] Transcriptional regulator of aromatic amino acids metabolism 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00331968 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCTGG AAGTCTTTTG TGAAGACCGA CTCGGTCTGA CCCGCGAATT ACTCGATCTA 
CTCGTGCTAA GAGGCATTGA TTTACGCGGT ATTGAGATTG ATCCCATTGG GCGAATCTAC
CTCAATTTTG CTGAACTGGA GTTTGAGAGT TTCAGCAGTC TGATGGCCGA AATACGCCGT
ATTGCGGGTG TTACCGATGT GCGTACTGTC CCGTGGATGC CTTCCGAACG TGAGCATCTG
GCGTTGAGCG CGTTACTGGA GGCGTTGCCT GAACCTGTGC TCTCTGTCGA TATGAAAAGC
AAAGTAGATA TGGCGAACCC GGCGAGCTGT CAGCTTTTTG GGCAAAAATT GGATCGCCTG
CGCAACCATA CCGCCGCACA ATTGATTAAC GGCTTTAATT TTTTACGTTG GCTGGAAAGC
GAACCGCAAG ATTCGCATAA CGAGCATGTC GTTATTAATG GGCAGAATTT CCTGATGGAG
ATTACGCCTG TTTATCTTCA GGATGAAAAT GATCAACACG TCCTGACCGG TGCGGTGGTG
ATGTTGCGAT CAACGATTCG TATGGGCCGC CAGTTGCAAA ATGTCGCCGC CCAGGACGTC
AGCGCCTTCA GTCAAATTGT CGCCGTCAGC CCGAAAATGA AGCATGTTGT CGAACAGGCG
CAGAAACTGG CGATGCTAAG CGCGCCGCTG CTGATTACGG GTGACACAGG TACAGGTAAA
GATCTCTTTG CCTACGCCTG CCATCAGGCA AGCCCCAGAG CGGGCAAACC TTACCTGGCG
CTGAACTGTG CGTCTATACC GGAAGATGCG GTCGAGAGTG AACTGTTTGG TCATGCTCCG
GAAGGGAAGA AAGGATTCTT TGAGCAGGCG AACGGTGGTT CGGTGCTGTT GGATGAAATA
GGGGAAATGT CACCACGGAT GCAGGCGAAA TTACTGCGTT TCCTTAATGA TGGCACTTTC
CGTCGGGTTG GCGAAGACCA TGAGGTGCAT GTCGATGTGC GGGTGATTTG CGCTACGCAG
AAGAATCTGG TCGAACTGGT GCAAAAAGGC ATGTTCCGTG AAGATCTCTA TTATCGTCTG
AACGTGTTGA CGCTCAATCT GCCGCCGCTA CGTGACTGTC CGCAGGACAT CATGCCGTTA
ACTGAGCTGT TCGTCGCCCG CTTTGCCGAC GAGCAGGGCG TGCCGCGTCC GAAACTGGCC
GCTGACCTGA ATACTGTACT TACGCGTTAT GCGTGGCCGG GAAATGTGCG GCAGTTAAAG
AACGCTATCT ATCGCGCACT GACACAACTG GACGGTTATG AGCTGCGTCC ACAGGATATT
TTGTTGCCGG ATTATGACGC CGCAACGGTA GCCGTGGGCG AAGATGCGAT GGAAGGTTCG
CTGGACGAAA TCACCAGCCG TTTTGAACGC TCGGTATTAA CCCAGCTTTA TCGCAATTAT
CCCAGCACGC GCAAACTGGC AAAACGTCTC GGCGTTTCAC ATACCGCGAT TGCCAATAAG
TTGCGGGAAT ATGGTCTGAG TCAGAAGAAG AACGAAGAGT AA
 
Protein sequence
MRLEVFCEDR LGLTRELLDL LVLRGIDLRG IEIDPIGRIY LNFAELEFES FSSLMAEIRR 
IAGVTDVRTV PWMPSEREHL ALSALLEALP EPVLSVDMKS KVDMANPASC QLFGQKLDRL
RNHTAAQLIN GFNFLRWLES EPQDSHNEHV VINGQNFLME ITPVYLQDEN DQHVLTGAVV
MLRSTIRMGR QLQNVAAQDV SAFSQIVAVS PKMKHVVEQA QKLAMLSAPL LITGDTGTGK
DLFAYACHQA SPRAGKPYLA LNCASIPEDA VESELFGHAP EGKKGFFEQA NGGSVLLDEI
GEMSPRMQAK LLRFLNDGTF RRVGEDHEVH VDVRVICATQ KNLVELVQKG MFREDLYYRL
NVLTLNLPPL RDCPQDIMPL TELFVARFAD EQGVPRPKLA ADLNTVLTRY AWPGNVRQLK
NAIYRALTQL DGYELRPQDI LLPDYDAATV AVGEDAMEGS LDEITSRFER SVLTQLYRNY
PSTRKLAKRL GVSHTAIANK LREYGLSQKK NEE