Gene B21_00733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00733 
Symbol
ID8114330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp770720 
End bp772321 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content58% 
IMG OID644847000 
Producthypothetical protein 
Protein accessionYP_002998573 
Protein GI251784269 
COG category[R] General function prediction only 
COG ID[COG5511] Bacteriophage capsid protein 
TIGRFAM ID[TIGR01539] phage portal protein, lambda family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACGC CCACCATTCC CACCCTTCTG GGGCCGGACG GCATGACATC GCTGCGCGAA 
TATGCCGGTT ATCACGGCGG TGGCAGCGGA TTTGGAGGGC AGTTGCGGTC GTGGAACCCA
CCGAGTGAAA GTGTGGATGC AGCCCTGTTG CCCAACTTTA CCCGTGGCAA TGCCCGCGCA
GACGATCTGG TACGCAATAA CGGCTATGCC GCCAACGCCA TCCAGCTGCA TCAGGATCAT
ATCGTCGGGT CTTTTTTCCG GCTCAGTCAT CGCCCAAGCT GGCGCTATCT GGGCATCGGG
GAGGAAGAAG CCCGTGCCTT TTCCCGCGAG GTTGAAGCGG CATGGAAAGA GTTTGCCGAG
GATGACTGCT GCTGCATTGA CGTTGAGCGA AAACGCACGT TTACCATGAT GATTCGGGAA
GGTGTGGCCA TGCACGCCTT TAACGGTGAA CTGTTCGTTC AGGCCACCTG GGATACCAGT
TCGTCGCGGC TTTTCCGGAC ACAGTTCCGG ATGGTCAGCC CGAAGCGCAT CAGCAACCCG
AACAATACCG GCGACAGCCG GAACTGCCGT GCCGGTGTGC AGATTAATGA CAGCGGTGCG
GCGCTGGGAT ATTACGTCAG CGAGGACGGG TATCCTGGCT GGATGCCGCA GAAATGGACA
TGGATACCCC GTGAGTTACC CGGCGGGCGC GCCTCGTTCA TTCACGTTTT TGAACCCGTG
GAGGACGGGC AGACTCGCGG TGCAAATGTG TTTTACAGCG TGATGGAGCA GATGAAGATG
CTCGACACGC TGCAGAACAC GCAGCTGCAG AGCGCCATTG TGAAGGCGAT GTATGCCGCC
ACCATTGAGA GTGAGCTGGA TACGCAGTCA GCGATGGATT TTATTCTGGG CGCGAACAGT
CAGGAGCAGC GGGAAAGGCT GACCGGCTGG ATTGGTGAAA TTGCCGCGTA TTACGCCGCA
GCGCCGGTCC GGCTGGGAGG CGCAAAAGTA CCGCACCTGA TGCCGGGTGA CTCACTGAAC
CTGCAGACGG CTCAGGATAC GGATAACGGC TACTCCGTGT TTGAGCAGTC ACTGCTGCGG
TATATCGCTG CCGGGCTGGG TGTCTCGTAT GAGCAGCTTT CCCGGAATTA CGCCCAGATG
AGCTACTCCA CGGCACGGGC CAGTGCGAAC GAGTCGTGGG CGTACTTTAT GGGGCGGCGA
AAATTCGTCG CATCCCGTCA GGCGAGCCAG ATGTTTCTGT GCTGGCTGGA AGAGGCCATC
GTTCGCCGCG TGGTGACGTT ACCTTCAAAA GCGCGCTTCA GTTTTCAGGA AGCCCGCAGT
GCCTGGGGGA ACTGCGACTG GATAGGCTCC GGTCGTATGG CCATCGATGG TCTGAAAGAA
GTTCAGGAAG CGGTGATGCT GATAGAAGCC GGACTGAGTA CCTACGAGAA AGAGTGCGCA
AAACGCGGTG ACGACTATCA GGAAATTTTT GCCCAGCAGG TCCGTGAAAC GATGGAGCGC
CGTGCAGCCG GTCTTAAACC GCCCGCCTGG GCGGCTGCAG CATTTGAATC CGGGCTGCGA
CAATCAACAG AGGAGGAGAA GAGTGACAGC AGAGCTGCGT AA
 
Protein sequence
MKTPTIPTLL GPDGMTSLRE YAGYHGGGSG FGGQLRSWNP PSESVDAALL PNFTRGNARA 
DDLVRNNGYA ANAIQLHQDH IVGSFFRLSH RPSWRYLGIG EEEARAFSRE VEAAWKEFAE
DDCCCIDVER KRTFTMMIRE GVAMHAFNGE LFVQATWDTS SSRLFRTQFR MVSPKRISNP
NNTGDSRNCR AGVQINDSGA ALGYYVSEDG YPGWMPQKWT WIPRELPGGR ASFIHVFEPV
EDGQTRGANV FYSVMEQMKM LDTLQNTQLQ SAIVKAMYAA TIESELDTQS AMDFILGANS
QEQRERLTGW IGEIAAYYAA APVRLGGAKV PHLMPGDSLN LQTAQDTDNG YSVFEQSLLR
YIAAGLGVSY EQLSRNYAQM SYSTARASAN ESWAYFMGRR KFVASRQASQ MFLCWLEEAI
VRRVVTLPSK ARFSFQEARS AWGNCDWIGS GRMAIDGLKE VQEAVMLIEA GLSTYEKECA
KRGDDYQEIF AQQVRETMER RAAGLKPPAW AAAAFESGLR QSTEEEKSDS RAA