Gene B21_03470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03470 
Symbolybl162 
ID8116263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3705206 
End bp3706276 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content44% 
IMG OID644849642 
Producthypothetical protein 
Protein accessionYP_003001215 
Protein GI251786911 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAA AACTAAAGAT TGCTGATATT GCCATGCGAA CAGGCCTTTC ATCCAGCACC 
GTGTCACGGG TTTTAGCGGG TAAAGCTAAT ACCAGTTATC GTGCCCGCGA GAAAGTATTA
GCGTGTGCCA GAGAGCTTGG GGTTATGGAT GGAATGGCAT CCGGCCGTAT GTTGTTAAAC
AACCTTGTCA TTTTTGCTCC TCAACGGGCT TTTGATGAGC GAACAGACAT TTTTTATTTC
CGCGTAATCC AAAGCATCAG CAAGGCATTA TCGCATTATG AGGTTCGGTT GCGTTACTGT
GCCCTTGATG AGTTCGATAG TACACCTTCA AAATTCCTGG CGCGCATGAA TGAGGCAGAA
ACACAGGCTG CAATATTGCT TGGTATTGAT GATCCTCATA TACATGATCT TGCAGCAGAT
TTTTCTAAGC CGTGCGTAAT GATTAACTGC CATGATCGAC GTATGCGTCT TCCCACAGTT
GCTCCAGATC ACAAAAACAT TGGTGCTTTT GCGTCTCACT TTCTGTTTGA AATGGGGCAT
CGCAGAGTAA TGAACATTAT GTGTTTACGT CGATACACAA TGGAACTGCG ACTTGCCGGA
ATTAAAGAAG CATGGGAGAG ACAGAATCAG AACTTTAAAG ATGACAGAGA TTTACTCACT
ATTAATAGTT TCAGCGCCAA AGAAGCCGAA GAAAAAGTCG GAGCATGGCT GGATATGACA
GAGAAAAGTA TGCTACCGAC TGCTTTTTTG GCAAGTGGGG ACTTCATTGC TGCAGGTATA
ATTAACGCTT TAAAAAAACG TAATATACGA GTGCCACAAG ATGTGTCCGT AATGAGTATT
GATGGGTTTA ATCTGGCTGC AATAGAAGAT GTGCCACTTA CGGCAGTACA TGTACCTCGC
GATGAACTGG GTACTGAAGC GGTACATATG TTACAACAAA GACTTGTACG TCCTGATGCA
ACTGTTGGTG CATTACTTTT GTATGGGAAA CTGGTAATAC GAGAATCTGT TCGCCGTATA
CGGCCAGGGA AAGAGCCAAC TCCGATTAAA GGAGATGGAC TGTATGACTG A
 
Protein sequence
MEKKLKIADI AMRTGLSSST VSRVLAGKAN TSYRAREKVL ACARELGVMD GMASGRMLLN 
NLVIFAPQRA FDERTDIFYF RVIQSISKAL SHYEVRLRYC ALDEFDSTPS KFLARMNEAE
TQAAILLGID DPHIHDLAAD FSKPCVMINC HDRRMRLPTV APDHKNIGAF ASHFLFEMGH
RRVMNIMCLR RYTMELRLAG IKEAWERQNQ NFKDDRDLLT INSFSAKEAE EKVGAWLDMT
EKSMLPTAFL ASGDFIAAGI INALKKRNIR VPQDVSVMSI DGFNLAAIED VPLTAVHVPR
DELGTEAVHM LQQRLVRPDA TVGALLLYGK LVIRESVRRI RPGKEPTPIK GDGLYD